AMERICAN 
JOURNAL OF MATHEMATICS 


FOUNDED BY THE JOHNS HOPKINS UNIVERSITY 


EDITED BY 


WEI-LIANG CHOW J. A. DIEUDONNE 
THE JOHNS HOPKINS UNIVERSITY INSTITUT DES HAUTES ETUDES SCIENTIFIQUES 


A. M. GLEASON 


PHILIP HARTMAN 
HARVARD UNIVERSITY 


THE JOHNS HOPKINS UNIVERSITY 


WITH THE COOPERATION OF 


L. V. AHLFORS S. S. CHERN F. I. MAUTNER 
A. BOREL C. CHEVALLEY J. MILNOR 

K. IWASAWA 
H. CARTAN idea A. WEIL 


PUBLISHED UNDER THE JOINT AUSPICES OF 
THE JOHNS HOPKINS UNIVERSITY 
AND 
THE AMERICAN MATHEMATICAL SOCIETY 


VOLUME LXXXIil 


1961 


THE JOHNS HOPKINS PRESS 
BALTIMORE 18, MARYLAND 
U. S. A. 


PRINTED IN THE UNITED STATES OF AMERICA 
BY J. H. FURST COMPANY, BALTIMORE, MARYLAND 


e 
| 


AMERICAN 
JOURNAL OF MATHEMATICS— 


FOUNDED BY THE JOHNS HOPKINS UNIVERSITY 


EDITED BY 


WEI-LIANG CHOW 


J. A. DIEUDONNE 
THE JOHNS HOPKINS UNIVERSITY 


INSTITUT DES HAUTES ETUDES SCIENTIFIQUES 


A. M. GLEASON 


PHILIP HARTMAN 
HARVARD UNIVERSITY 


THE JOHNS HOPKINS UNIVERSITY 


WITH THE COOPERATION OF 


L. V. AHLFORS S. S. CHERN 

A. BOREL . CHEVALLEY 
. IWASAWA 

a . KODAIRA A. WEIL 


F. I. MAUTNER 


PUBLISHED UNDER THE JOINT AUSPICES OF 


THE JOHNS HOPKINS UNIVERSITY 
AND 


THE AMERICAN MATHEMATICAL SOCIETY 


Volume LXXXIII, Number 1 
JANUARY, 1961 


THE JOHNS HOPKINS PRESS 
BALTIMORE 18, MARYLAND 
U. S. A. 


CONTENTS 


Autosynartetic solutions of differential equations. By D. C. Lewis, 


Calculation of class numbers by decomposition into three integral squares 
in the fields of 24 and 34. By Harvey Coun, . ; . = 


Points multiples d’une application et produit cyclique reduit. par ANDRE 


Finite groups admitting a fixed-point-free automorphism of order 4. 
By DANIEL GORENSTEIN and I. N. HERSTEIN, . ‘ < 


On induced representations. By Ropert J. BLATTNER, . : 


On Chow varieties of maximal, total, regular families of positive divisors. 
By J. P. Murrg, . 99 


On the algebra of representative functions of an analytic group. By 
G. HocHscHILp and G. D. Mostow, ‘ ‘ ‘ 


Lineare gruppen iiber lokalen ringen. Von WILHELM KLINGENBERG, 


On differential equations and the function J,?+Y,?. By PHILIP 
HARTMAN, 


Symmetric products and Jacobians. By ArTHUR MATTUCK, 


Correction to “Applications of the theory of Morse to symmetric spaces.” 
By Raovut Bott and Hans SAaMELson, 


The AMERICAN JOURNAL OF MATHEMATICS appears four times yearly. 


The subscription price of the JouRNAL is $11.00 in the U. S.; $11.30 in Canada and 
$11.60 in other foreign countries. The price of single numbers is $3.00. 


Manuscripts intended for publication in the JourNnaL should be sent to Professor 
W. L. Cow, The Johns Hopkins University, Baltimore 18, Md. 


Subscriptions to the JoURNAL and all business communications should be sent to 
THE JOHNS Hopkins Press, BALTIMORE 18, MARYLAND, U.S. A. 


THE JoHNs HOPKINS Press supplies to the authors 100 free reprints of every 
article appearing in the AMERICAN JOURNAL OF MATHEMATICS. On the other hand, 
neither THE JoHNs HopkKINS Press nor the AMERICAN JOURNAL OF MATHEMATICS can 
accept orders for additional reprints. Authors interested in securing more than 100 
reprints are advised to make arrangements directly with the printers, J. H. Furst Co., 
109 MARKET PLACE, BALTIMORE 2, MARYLAND. 

The typescripts submitted can be in English, French, German or Italian and should 
be prepared in accordance with the instructions listed on the inside back cover of this 
issue. 


Second-class postage paid at Baltimore, Maryland. 


PRINTED IN THE UNITED STATES OF AMBRICA 
BY J. H, FURST COMPANY, BALTIMORE, MARYLAND 


PAGE 
... 


\ 
\ 


~ 
- 


AUTOSYNARTETIC SOLUTIONS OF DIFFERENTIAL 
EQUATIONS.* 


By D. C. Lewis.* 


1. Introduction. We are concerned with the differential system 


(1.1) dx/dt =f (t,x), 


where x and f are n-vectors and ¢ is the scalar independent variable. We 
assume that f is defined and of class C’ in a suitably chosen region, which 


we shall not need to specify in detail. 
A simple and familiar example of the sort of thing we wish to study 
occurs when f is periodic in ¢ with period T. This situation may be described 


by saying that the transformation 


(1. 2) s=t+T, 2, 


takes the equation (1.1) into the form y’=—f(s,y) where the accent denotes 
differentiation with respect to s. This is just the original equation in a 
different notation. If, now, it is known that a particular solution x(t) of 
(1.1) has the property that 2(0) —2(T), it is immediately obvious that z(t) 
must be periodic with period T ; in other words, 7(¢) must satisfy a functional 
equation, 


(1.3) a(t+T)—<(t), 


at least, if x(¢) can be defined for all values of ¢. 
Suppose now that (1.1) is carried into itself by a more complicated 
transformation than (1.2), say by the transformation, 


(1.4) s=P(t,z), y=h(t,2), 


where P is a scalar and h a vector and both are of class C’ in ¢ and z. If, 
now, it is known that a certain solution z(t) has the property that h(0,2(0)) 


* Received April 18, 1960. 

* This research was partially supported by the United States Air Force through the 
Air Force Office of Scientific Research of the Air Research and Development Command, 
under Contract Number AF 49 (638) -382. Reproduction in whole or in part is per- 
mitted for any purpose of the United States Government. 


1 


G 


2 D. C. LEWIS. 


=2(P(0,z(0))) it will turn out that z(¢) must satisfy the functional 
equation, 
(1.5) a(P(t,x(t))) =h(t,x(t)), 


at least, if x(t) can be defined for all values of ¢. 

Evidently, if P(t,z) =t+T and h(t,7) =a, (1.4) and (1.5) reduce 
respectively to (1.2) and (1.3); so that the situation in which f(t,z) is 
periodic in ¢ is indeed a special case of what we wish to consider. 

An important problem concerning the simpler situation is the perturba- 
tion of periodic solutions with respect to a parameter, a problem first considered 
by Poincaré. We shall show that the main features of this theory of Poincaré 
and his followers may be extended to cover the perturbation of a solution 
satisfying (1.5). Such solutions will be called autosynartetic; and we shall 
present theorems about the degeneracy of an autosynartetic solution, about the 
associated so-called bifurcation equations, about the influence of certains kinds 
of first integrals on the degeneracy and on the bifurcation equations, and 
about phenomena associated with the presence of certain kinds of continuous 
groups (or semi groups) of transformations which take (1.1) into itself. 

For instance, when the system (1.1) is autonomous (i.e. when f(t, 2x) 
=f(z) is independent of ¢) and when therefore we may imbed the trans- 
formation (1.2) in the continuous group of transformations, 


(1.6) s=t+T+), y= TZ, 


(where A is the parameter of the group) in such a manner that (1.6) equally 
with (1.2) takes (1.1) into itself, the well known fact, that any non-constant 
periodic solution of dx/dt —f(x) must be degenerate, is just a simple special 
case of one of our theorems on general autosynartetic solutions. 

Our theories, in addition to applying to the entirely new subject of 
general autosynartetic solutions, may even yield a few new results about the 
simpler cases already studied. Thus, for instance, to the best of the author’s 
knowledge, Theorems 8.4, 8.5, and 8.6, specialized to the periodic case, have 
not appeared in fully developed form in the literature, although partial results 
along these lines are given in [4] (cf. the References at the end of the paper). 
Again, in some special non-autonomous periodic case, it might be possible to 
imbed (1.2) in some 1-parameter transformation group, 


s==P(t,x,2), y=h(t,2,d), 


with P(t,z,0) =t+T and h(t,z,0) 2. Our theory then says that any 
periodic solution of (1.1) must be degenerate, at least, if we impose one 


DIFFERENTIAL EQUATIONS. 3 


further mild condition on the periodic solution in question, which takes the 
place of the condition mentioned above in the autonomous case that the 
solution should not be a constant. 

We shall also discuss necessary and sufficient conditions that (1.4) 
should transform (1.1) into itself, using a definition which may strike the 
reader as a little peculiar, since it does not require (1.4) to possess an 
inverse. This definition was adopted, since in the major part of the paper 
we never need to assume the existence of an inverse to (1.4). We shall show 
that infinitely many such transformations always exist, in fact, even when 
P(t,x), as well as f(t,x), is prescribed. These transformations are, however, 
determined by the integration of a system more complicated than (1.1) and 
hence cannot be expected to be of any real help in any qualitative discussion 
of the solutions of (1.1). Such help is to be expected only when a particular 
transformation is more or less obvious from the structure of the equations. 
Thus, for example, the equations of celestial mechanics admit both rotational 
and translational symmetry. It often happens, under such circumstances, 
that the functional equation (1.5) merely expresses the fact that x(¢) is in 
some sense a periodic solution, when suitable coordinates are employed (cf. 
“les trois sortes de solutions périodiques” of Poincaré [5], vol. 1, pp. 95-97). 
At other times it gives us periodic solutions possessing certain special prop- 
erties of symmetry. An indication of how this may occur is as follows: 

Suppose that, in (1.4), P(t,7) =t+T, while y—h(t,xv) =h,(z) is 
independent of ¢ and generates a finite cyclic group of transformations of 
order &. Then the functional equation (1.5) implies that 


a(t+ mT) =hy(x(t)), 


where h,, denotes the m-th iterate of h,. Since h, is the identity transforma- 
tion, we see at once that, under present hypotheses, our functional equation 
expresses the fact that x(t) is a certain special kind of periodic solution with 
period kT’. 

Another possibility may arise if y=h,(x) generates an infinite free 
group, such that, for any positive «, we shall have |h»(«)—a|<e for 
infinitely many values of m. We then get a recurrent solution, or, if hm(x) 
recurs to an approximation of the identity in a suitably uniform manner, we 
would get an almost periodic solution. 

In spite of these trivialities in the presently contemplated applications, 
the transformations considered in this paper can be of a much more general 
type, well deserving serious study for their own sake. 


4 D. C. LEWIS. 


2. Some elementary theorems on the transformations of differential 
equations. 


DEFINITION 2.1. To say that (1.4) transforms the system (1.1) tnto 
the system, 
(2.1) dy/ds = 9(s,y), 


means that to every solution x(t) of the system (1.1) there corresponds a 
solution y(s) of (2.1) such that 


Notice that this definition does not require (1.4) to have an inverse, 
although certainly one would ordinarily expect an inverse to exist in cases 
of principal importance. 


THEOREM 2.1. Jf (1.4) transforms (1.1) ito (2.1), the functions 
h(t,2) and P(t,x) must satisfy the vector partial differential equation, 


hi(t, he(t, x) f (t,x) 
=g[P(t,x), h(t, x) ][Pr(t,x) + P.(t,x)f(t,2)], 


at every point of the common domain of definition of f, P, and h. 


(2.3) 


Proof. Differentiate the identity (2.2) with respect to ¢ and use the 
fact that dy(s)/ds—=4g/(s,y(s)) with s—P(t,x(t)). We also use the facts 
that dz(t)/dt —f(t,2(t¢)) and consequently that 


ds/dt P,(t,x(t)) + 


The result is 


gl P(t, 2(t)), yLP(é, 
[Pi(t, 2(t)) + Pa(t, 2(t)) f(t, v(t) ] = he(t, v(t) + halt, f(t, x(t). 


Finally we notice that, by the existence theorem for the system (1.1), there 
is a solution through every point of the space where f is defined. Hence the 
last identity in ¢, based on an arbitrary solution x(t) of (1.1), becomes an 
identity in ¢ and 2, if we eliminate y with the help of (2.2) and then replace 
x(t) simply by z. 


THEOREM 2.2. If P(t,x) and f(t,x) have the property that 


(2.2) y[P(t,2(t))] 2(t)]. 

(2. 4) P;(t, x) + Pz(t,x)f(t,x) 40 


DIFFERENTIAL EQUATIONS. 5 
at every point of their common domain of definition and tf h(t,x) ts any 
vector function satisfying (2.3), then (1.4) transforms (1.1) into (2.1). 

Proof. Let 2(t) be any solution of (1.1). Then by (2.4) we have 
(d/dt)[P(t, x(t))] = Pr(t, 2(t)) + Palt, a(t) ) f(t, AO. 
Hence by the implicit function theorem, the equation 
(2.5) s== P(t, 2(t)) 


may be solved for ¢ in terms of s. It is then easy to see from Definition 2. 1 
that it will be enough to prove that the vector function y(s) defined as being 
the same as h[t(s),z(t(s))] is a solution of (2.1). Remembering that 
y(s(t)) =h(t,x(t)), we see from (2.5) and (1.1) that dy(s(t))/ds(t) 
= (dy/dt)/(ds/dt) = [he(t, x) + he(t, x) f(t, ) | [Pr(t, + Pa(t, 2)f(t,2)], 
where we have, of course, used x as an abbreviation for z(t). We thus obtain 
from (2.3) the result that 


y’(s(t)) =g[P(t, 2), h(t, x) ] =g[s(t), y(s(t))]. 


Changing the independent variable from ¢ to s, we obtain the desired result. 
Before passing to the next theorem, it is convenient to introduce the 
following further notation: Let 


(2.6) to, Xo) 
be the solution of (1.1) such that 
(2. 7) E(to, to, Lo) = 


Let together with w—U(t, to, 2, Uo) be the solution of the 
following enlarged system of order 2n: 


(2. 8a) dz/dt = f(t, 2), 
(2. 8b) du/dt = G(t,2,u), 


where G(é, x, h) is an abbreviation for g[ P(t, x), h][P:(t, x) + P.(t, z)f(t, z)], 
so that consequently the partial differential equation (2.3) appears in the 
abbreviated form 


(2.9) hi(t,2) +he(t,x)f(t,7) = Gt, 2, h(t, z)]. 
The function U (t, to, %, Uo) is to satisfy the condition 


(2. 10) U(t, toy Voy Uo) = Up. 


6 D. C. LEWIS. 


The fact, that such functions é and U, affording a solution of (2.8) and 
satisfying the initial conditions indicated by (2.7) and (2.10), actually exist, 
is, of course, a consequence of the known theory of ordinary differential 


equations. 


THEOREM 2.3. The n-vector function 
(2. 11) h(t, U[t, to, € (to, t, t), H(E(to, t, )] 


satisfies the partial differential equation (2.9) (which is the same as (2.3)), 
together with the initial condition, 


(2. 12) h(to,t) = H(z). 


Here it is understood that H (2) is an arbitrary n-vector function of class C’. 

The proof of this theorem may be left to the reader, since a theory for 
the equation (2.9) in which h is a vector may be formulated exactly as if 
h were a scalar. Such a theory in the scalar case is given by Kamke [3], 
vol. 2, pp. 40-42. It may also be remarked that the verification of (2.12) 
is immediate with the help of (2.7) and (2.10), and that the verification 
of (2.9), although somewhat more complicated, may be carried out in a 
straightforward manner by differentiation of (2.11) and by use of well known 
properties of the functions é and U. 


3. The transformation of a system of differential equations into itself. 
We continue the discussion initiated in the preceding section only for the 
special case in which 


(3.1) =f(t,z). 


We also shall assume throughout the rest of the paper that the inequality 
(2.4) holds. With this understanding the following theorem is an immediate 
corollary of Theorems 2.1 and 2.2). 


THEOREM 3.1. A necessary and sufficient condition that (1.4) transform 
(1.1) into itself is that 


(3.2) hi(t,x) + x)f(t, x) = f[ PC, x), h(t, x) | x) + x) f(t, x) | 


Similarly specializing Theorem 2.3 by (3.1), and using Theorem 3.1, 
we may state the obvious 
THEOREM 3.2. When P(t,x) and f(t,x) are given, h may be found in 


infinitely many ways m such a manner that (1.4) transforms (1.1) into 
itself. In fact we may assign h(to,x) arbitrarily for any fixed to. 


il 


DIFFERENTIAL EQUATIONS. 7 


DEFINITION 3.1. If (1.4) transforms (1.1) into itself, and if x(t) 1s 
any solution of (1.1), the solution y(t) of (1.1), (possibly not distinct from 
a(t)), which is such that 


(3.3) y[P(t,x(t))] = h(t, 2(t)), 


is said to be synartetic to x(t) under the transformation (1.4). If y(t)= (1), 
we shall term x(t) an autosynartetic solution of (1.1) under (1.4). 


Notice that, under the assumptions of this definition, the existence of 
the y(t) mentioned above is guaranteed by Definition 2.1. 


THEOREM 3.3. Assuming that (1.4) transforms (1.1) into ttself, a 
necessary and sufficient condition that a solution y(t) be synartetic to a 
solution x(t) of (1.1) under (1.4) is that 


(3. 4) yLP (to, &(to) ) | = h (to, (to) ) 
for any fixed tp. 


Proof. The necessity of (3.4) is obvious, since we may substitute ¢, for 
tin (3.3). 

The sufficiency follows by an easy argument based, firstly, on the existence 
of a solution synartetic to a given solution, x(¢), as guaranteed above, and, 
secondly, on the uniqueness theorem for equation (1.1). 

By considering the special case yz, we get the following obvious 
corollary to Theorem 3.3, to which we must frequently refer in the later 
sections of this paper. 


THEOREM 3.4. Assuming that (1.4) transy ems (1.1) into itself, a 
necessary and sufficient condition that a solution x(t) of (1.1) should be 
autosynartetic under (1.4) ts that 


(3.5) t[P (to, ) | = h(to, x (to) ) 
for any fixed to. 
In the next section we tacitly assume the validity of 


THEOREM 3.5. Let x(t,a) be a family of solutions of (1.1) which is 
continuous and continuously differentiable with respect to the parameter «. 
Let y(t,a) be the family of solutions of (1.1) synartetic under (1.4) to 
a(t,a). Then y(t,«) ts also continuous and continuously differentiable. 


Proof. We may, because of (2.4), solve the equation, s = P(t, a(t, «) ) 


d 
? 
| 
, 
r 
f 
) 
a 
e 
y 
e 
? 
0 


8 D. C. LEWIS. 


for t= t(s,a). Hence the solution y(t,a), synartetic to z(t,a), is given 
by the formula, y(s,a) =A[t(s,),x(t(s,a),a)]. If h, P, and @ are of 
class C’, so is t(s,a), and hence eventually y(s,«). 


4. Properties of the variational system and of its adjoint. Let us 
consider an arbitrary solution r—¢(t) of (1.1). The equations of varia- 
tion based on this solution are 


(4.1) dé/dt = A(t)é, 
where 
(4.2) A(t) =fa(t, 


The equations of variation have the following familiar fundamental property: 
If «—-2(t,a) is a family of solutions of (1.1) depending on a parameter a 
in any manner and such that z(t,0) = ¢(¢), then —0z(t,0)/da is a 
solution of (4.1). 

Now let us suppose that (1.4) transforms (1.1) into itself. Let us 
denote the synartetic solution of z(t,a) by y(t,a). Then y(t) = dy(t,0)/da 
will be a solution of the system dy/dt — A*(t)n where A*(t) = f,(t, o*(#)), 
in which $*(¢) —y(t,0) is of course synartetic to r(t,0) —¢(t). Conse- 
quently, if ¢(¢) is auto-synartetic, we shall have ¢*(¢) —¢(t) and A*(¢) 
==A(t); so that 7 satisfies the same system (4.1) as é. 

We now proceed to compute the relationship between € and y. Since 
y(t,a«) is synartetic to 2(t,«), we know that y(s,@) is obtained from the 


equations, 
(4.3) y =h(t, x(t, a)) 
(4. 4) s==P(t,x(t,a)) 


by the elimination of ¢. Hence 
(4.5) dy/da = dy/da +- (dy/dt) (8t/8a) 


where partial derivatives obtained under the assumption that « and ¢ are 
independent variables are denoted by @, while those obtained under the 
assumption that « and s are independent variables are denoted by 8. From 
(4.3) we obtain 


dy/0a and 
dy/0t = hy (t, x(t, a) ) + h,(t, x(t, a) )f(t, x(t, a)), 


where we have simplified the last expression by using the fact that x(t, «) 


(4. 6) 


DIFFERENTIAL EQUATIONS. 9 


is a solution of (1.1). We further transform the last expression with the 
help of (3.2), thus obtaining 


(4.7) dy/at 
f[P(t, x(t, «)), h(t, «))] [Pr(t, v(t, + Pa(t, x(t, «)) f(t, x(t, «))]. 

We next compute 0t/d« from (4.4) as follows: 
0 = $s/da 

[Pi(t, x(t, + Pa(t, v(t, «)) f(t, x(t, (St/da) + Po(t, x(t, a)) 
Hence 

— (0x/0a)P,(t, x(t, «))[Pr(t, x(t, «)) + x(t, «)) f(t, x(t, «))]-. 

Using this last result together with (4.6) and (4.7) we see from (4.5) that 


by/da = [ha(t, x(t, —f[P(t, x(t, A(t, a(t, «))]°Pa(t, a(t, ](02/0a). 


Here, and throughout the remainder of the paper, f°P, denotes the matrix 
of order n the element in whose i-th row and j-th column is the product of 
the i-th component of f by the j-th component of P,. This is to be con- 
trasted with the scalar product P,f used also very frequently in this paper. 

Setting «= 0, z(t, 0) = ¢(t), dy(s, 0) /8a—n(s) and d2(t, 0) /da €(t), 
we obtain 


(4.8) n(s) = B(t)é(t) 
where the matrix B(t) is defined by 


(4. 9) B(t) = halt, (t)) —f[PE, A(t, ]°Palt, (4)), 


or, since #(¢) is autosynartetic, by 


(4.9alt.) B(t)=ha(t, o(t)) PC $(2)), (PCE, o(4))) ]°Palt, o(2)), 
and where the s of (4.8) is related to the ¢ by means of (4.4) with the 
@==0, in other words by 
(4.10) s—=P(t,p(t)) =p(t). 

We interpret and summarize the main results of this discussion in the 
following : 


THEOREM 4.1. The variational system of (1.1) based on an auto- 
synartetic solution $(t) of (1.1) under (1.4) ts transformed into itself by 
means of the transformation 


(4.11) s=p(t), »=—B(t)é 


en 
of 

us 

a- 

a 

8 

Te} 

e 

e 


10 D. C. LEWIS. 


where the scalar function p(t) and the matrix function B(t) are given 
respectively by (4.10) and by (4.9). 

Proof. According to Definition 2.1, we need only to show that to every 
arbitrary solution €(¢t) of (4.1) there corresponds a solution y(t) of (4.1) 
such that »(p(t)) =B(t)é(t), or, in other words, such that (4.8) holds 
with s given by (4.10). Our previous discussion therefore yields a complete 
proof, as soon as it is established that an arbitrary solution é(¢) can be 
represented in the form dx(t,0)/da used above. This is an elementary detail 
which we leave to the reader. 

An almost immediate corollary of Theorem 4.1 is 


THEOREM 4.2. Between the matrices A(t), B(t), and the scalar p(t) 
there is the following relationship: 


(4. 12) dB(t)/dt + B(t)A(t) — A(p(t))B(t)(dp(t)/dt) =0. 


Proof. Since (4.11) transforms (4.1) into itself we may apply the 
partial differential equation (3.2), with appropriate changes of notation, 
to the present linear situation. In this way we obtain the result that 
(dB(t)/dt)Eé+ B(t)A(t)E—=A(p(t))B(t)E(dp(t)/dt). But, since this is 
an identity in the vector €, we obtain (4.12) at once. 

It is also possible to verify (4.12) directly from the definitions of A(t), 
B(t), and p(t) and from the fact that h satisfies (3.2). But the calculation 
is rather tedious. From such an alternative proof of Theorem 4.2, we also 
have an alternative proof for Theorem 4.1, which follows from Theorem 4. 2 


with the help of Theorem 3.1. 
At this stage, it is convenient to introduce the function q(t) which is 


the inverse of p(t), so that 
(4. 13) p(q(t)) =q(p(t)) 
This is legitimate, since by (4.10) and (2.4), 

p(t) =Pi(t, (t)) + Po(t, o(t) 
is never zero. We also introduce the matrix C(t) defined by 
(4. 14) O(t) =B(q(t))’ 


The accent is used here and in the remainder of the paper to denote the 


transpose of a matrix. 


DIFFERENTIAL EQUATIONS. 11 


THEOREM 4.3. The linear system 
(4.15) dé/dt =—A(t)’E or dé/dt=——E€A(t), 
which is the so-called adjoint to the system (4.1), is transformed into itself 
by means of the transformation, 
(4. 16) s—q(t), 

Proof. From Theorem 3.1, as in the proof of the previous theorem, 
we find that a necessary and sufficient condition that (4.5) be transformed 
into itself by (4.16) is the validity of the identity 
(4.17) dC(t)/dt —C(t)A(t)’ + A(q(t))’C (4) (dq(t) /dt) =0. 
If we replace ¢ by p(t) and then multiply by dp/dt we get the equivalent 
identity, 

[40 (p(t)) /dp] (dp(t) /dt) —C(p(t)) A (p(t) )’(dp(t) /dt) 

+ Alg(p(t)) (p(t) ) (dq(p(t) )/dp) (dp(t) /dt) =0. 
This, with the help of (4.13) and (4.14), is seen to be equivalent to 
dB (t)’/dt — B(t)’A (p(t) )’(dp(t) /dt) + A(t)’B(t)’=0. 


But this last identity is merely the transpose of (4.12). Thus we have shown 
that (4.17) is equivalent to the already established identity (4.12). 


THEOREM.4.4,. In order that a solution &(t) of the variational equations 
(4.1) be autosynartetic under the transformation (4.11) it is necessary and 
sufficient that it satisfy the “boundary conditions,” 


(4. 18) é(T) = Bé(0), 
where 
(4.19) T=p(0) and B=B(0). 


Proof. This is a special case of Theorem 3.4 applied to the linear 
system under (4.11) instead of to the general system (1.1) under (1.4). 
And, in making this application, we take ¢, = 0. 


THEOREM 4.5. In order that a solution &(t) of the adjoint system (4. 15) 
be autosynartetic under the transformation (4.16) it 1s necessary and sufficient 
that tt satisfy the boundary conditions 


(4. 20) é(T)B = £(0) 


where T and B are given by (4.19). 


12 D. C. LEWIS. 


Proof. Again appealing to Theorem 3.4, we find that a necessary and 
sufficient condition that é(t) be autosynartetic under (4.16) is that é[q(T) ] 
=C(T)é(T). In obtaining this result we take the tf) of Theorem 3.4 to 
be 7. From (4.13) and (4.14), we find that q(7’) =0 and that C(T) = B’, 
whence it appears that (4.20) is an equivalent condition. 

Whenever we refer to autosynartetic solutions of the variational system 
or its adjoint, we always mean autosynarteticity under (4.11), in the case 
of the variational system, and under (4.16) in the case of the adjoint system. 
Thus, according to Theorems 4.4 and 4.5, autosynartetic solutions of the 
variational equations are those which satisfy the boundary conditions (4.18) 
and autosynartetic solutions of the adjoint equations are those which satisfy 
the boundary conditions (4.20). With this understanding we have 


THEOREM 4.6. The maximum number of solutions in any set of linearly 
independent autosynartetic solutions of the variational equations (4.1) 1s 
equal to the maximum number of solutions in any set of linearly independent 
autosynartetic solutions of the adjoint system (4.15). 


Proof. Let the n n matrix satisfy dX/dt = A(t)X and det X (0) 
+0. Hence det X(t) 40. Corresponding to any arbitrary solution é(t) of 
the variational equations, there exists an n-vector c such that €(t) = X(t)c, 
and, by Theorem 4.4, this is autosynartetic under (4.11) if and only if 
X(T)c—=BX(0)c, which is equivalent to ¥(T)“[X(T) —BX(0)]c=0. 
Hence the number of linearly independent autosynartetic solutions of the 
variational equations 1s equal to the number of linearly independent relation- 
ships between the columns of the matrix X(T)-*[a(T) —BX(0)]. 

Now, if we set Y(t)’ X(t)~*, it is well known (and indeed easy to see 
by differentiation of Y(¢)’X(t) =J) that dY/dt =—A(t)’Y and det Y(t) 
+0. Hence, corresponding to any arbitrary solution é,(¢) of the adjoint 
system, there exists an n-vector c, such that &,(¢) = Y(t)c,, and, by Theorem 
4.5, this is autosynartetic under (4.16) if and only if [Y(T)c,]B=Y(0)c, 
or ¢,Y(T)’B=c,Y(0)’. In terms of X(t) this condition appears (after 
some simple manipulations) in the equivalent form ¢,X(T)*[X(T) — BX(0)] 
=0. Hence the number of linearly independent autosyartetic solutions of the 
adjoint system is equal to the number of linearly independent relationships 
between the rows of the matrix X(T)“[X(T) —BX(0)]. 

The theorem follows at once from the above two italicized sentences. 


5. Synartetic first integrals and so-called degeneracy. First integrals 
are defined in the usual way; namely a scalar function J(t,) is called a first 


DIFFERENTIAL EQUATIONS. 13 


integral of (1.1), if its value is independent of ¢ whenever z is replaced by 
any solution x(t) of (1.1). It is both obvious and well known that a necessary 
and sufficient condition that J be a first integral is that 


(5.1) J:(t,r) +Jo(t, x) f(t, 7) =0, 
at least, if J is of class C’. 


If (1.1) is transformed into itsef by (1.4), then with every solution 
a(t) of (1.1) there is associated a synartetic solution y(t). Hence, if J (t,x) 
is a first integral of (1.1), we see that J(s,y(s)) must be independent of s. 
Hence, setting s = P(t,x(t)) and remembering that then y(s) =h(t,x(¢)) 
by definition of a synartetic solution, we see that J[P(t, x(t)), h(t, x(t))] must 
be independent of ¢ no matter with what solution x(t) we may be dealing. 
In other words, we have established 


THEOREM 5.1. If J(t,x) is a first integral of (1.1) and tf (1.1) 1s 
transformed into itself by (1.4), then J[P(t, x), h(t, x) | is also a first integral 


of (1.1). 

DEFINITION 5.1. Under the assumptions of Theorem 5.1, the first 
integral J[P (t,x), h(t, x) | ts said to be synartetic to the first integral J (t,x) ; 
and, tf it happens that 
(5.2) 
we shall say that the first integral J(t,x) 1s autosynartetic under (1.4). 


THEOREM 5.2. Let o(t) be an autosynartetic solution of (1.1) under 
(1.4). Let J(t,x) be an autosynartetic first integral of (1.1) also under 
(1.4). Then é(t) =J2(t, (t)) ts an autosynartetic solution of the adjoint 
system (4.15) of the equations of variation (4.1). 


Proof. Differentiating (5.2) with respect to x and then setting z = ¢(t) 
and p(t) = P(t, ¢(t) ), we find that 


Telp(t), h(t, + Jel p(t), h(t, o(t)) =Ja(t, 
But according to (5.1) we know that 
Jilp(t), h(t, o(t))] =— Jel p(t), h(t, o(¢)) h(t, 


Hence, eliminating J;[p(t),h(t,¢(t))] and remembering that h(t, ¢(t)) 
= ¢(p(t)), since ¢ is autosynartetic, we find that 


Tol p(t), $(p(t)) [ha(t, $(t)) —f[p(t), h(t, o(t))]°Po(t, $(t))] = Ja(t, $(#)). 


14 D. C. LEWIS. 


Thus, remembering that p(t) —P(t,¢(t)) and =Jz(t,¢(t)), we find 
from (4.9) that é(p(t))B(t) =€(t). Setting 0, we find from (4.19) 
that é(¢) satisfies the boundary condition (4.20). It remains to show that 
é(t) satisfies the adjoint system (4.15). Since this may be done as in the 
periodic special case, we leave this part of the proof to the reader. See Lewis 
[4], p. 542, lines 23-37, where, however, the notation as well as the context 
must be modified to fit present circumstances. 


DEFINITION 5.2. The degeneracy of an autosynartetic solution $(t) 
of the system (1.1) is equal to the maximum number of solutions of the 
variational equations (4.1) in any set of linearly independent autosynartetic 
solutions of these equations or, what by Theorem 4.6 is the same thing, 
the maximum number of solutions of the adjoint equations (4.15) im any 
set of linearly independent autosynartetic solutions of the adjoint equations. 


If there are no autosynartetic solutions of (4.1) other than the trivial 
solution = 0, the degeneracy is of course 0 by the definition, and we then 
sometimes say that ¢(¢) is non-degenerate. 


THEOREM 5.3. If the system (1.1) admits k “independent” autosynar- 
tetic first integrals, then the degeneracy of any autosynartetic solution $(t) 
of (1.1) is at least k. 


Here the hypothesis that the k& first integrals should be independent is 
interpreted as meaning that the rank of the jacobian matrix of the & integrals 
with respect to the components of x should be k when x(t). This insures 
the existence of & linearly independent autosynartetic solutions of the adjoint 
system as indicated in Theorem 5.2, so that no further proof is needed for 
Theorem 5.3. 


THEOREM 5.4. If (1.1) admits a k-parameter family of autosynartetic 
solutions, the degeneracy of any one of the solutions imbedded in the family 
is at least k. 


Proof. Suppose the k-parameter family of autosynartetic solutions of 
(1.1) is represented by x =<«(t,c), where c is a k-vector and x(t,0) —¢(t). 
Then the & columns of the nzk matrix x,(t,0) are obviously solutions of the 
variational equations (4.1) based on $(t). Moreover, since r(t,c) for each 
fixed c is given as autosynartetic under (1.4), we have from Theorem 3.4 


z[P(0, c)), c] = h(0, c)). 


DIFFERENTIAL EQUATIONS. 15 


Differentiating with respect to c and then setting c—0O and also using the 
fact that z(t,c) is a solution of (1.1) we obtain 


fLP(O, $(0)), $(0))]]°P2(0, $(0))x-(0, 0) + reLP(0, $(0)), 0] 
(0) 0), 
which, because of (4.9 alt.), (4.10), and (4.19), can be written 
== Bz,(0,0). 


Hence 2,(t¢,0) satisfies the boundary conditions specified in Theorem 4. 4. 
The fact that z(t,c) is actually a k-parameter family means also that the 
rank of the matrix z,(t,c) is k. Thus, the & columns of 2,(t,0) yield & 
linearly independent autosynartetic solutions of the variational equations, so 
that, by Definition 5.2, the degeneracy of ¢(¢) must be at least k. 


THEOREM 5.5. Assume that the set of all transformations which trans- 
form (1.1) into itself contains a continuous family F of transformations of 
class C’ satisfying the following three conditions: 


Pil. Every transformation of F commutes with (1.4). 
P2. F contains the identity transformation. 


P3. There is a solution 6(t) which is autosynartetic under (1.4) but 
which is not autosynartetic under any transformation of F close to the identity 
(except under the identity itself). 


Then o(t) has degeneracy = 1. 


Proof. A transformation S which transforms (1.1) into itself according 
to Definition 2.1 provides a mapping of the set of all solutions of (1.1) into 
itself. Namely S maps an arbitrary solution z of (1.1) onto the solution y 
which is synartetic to z under S. We express this mapping by writing Sx = y. 
Let S* denote the transformation (1.4) and let S, denote a transformation 
of F, where A represents a set of values for the parameters of F. We may 
suppose by P2 that S, is the identity transformation. 

By P3, we have S*¢—¢. Hence 8,S*p—S8)¢. Whence, from P1, we 
obtain S*S\¢ Hence Sy¢ is autosynartetic under S*. Hence by P2, 
= Sod is imbedded in a continuous family of solutions {S,¢} each of which 
is autosynartetic under S*, i.e. under (1.4). Because of P3, S\@~¢@ for A 
close to 0, except for A=0. Hence the number of essential parameters in our 
family of autosynartetic solutions is surely greater than zero. Hence by 
Theorem 5.4 the degeneracy of ¢ is greater than zero, as we wished to prove. 


16 D. C. LEWIS. 


In the above theorem the family F does not necessarily form a group. 
In fact, our Definition 2.1 does not require the transformations which “ trans- 
form (1.1) into itself” to have inverses: In fact, if {x} is the set of all 
solutions x of (1.1) and if § is a transformation transforming (1.1) into 
itself, the set {Sz} of all synartetic solutions under 8 can be a proper subset 
of {z}. 

If, however, the transformation S* (i.e. (1.4)) is an element of a full 
differentiable continuous group G of transformations taking (1.1) into itself, 
we know from the theory of Lie that G possesses a one-parameter continuous 
abelian subgroup F also containing S*. Hence the conditions P1 and P2 are 
automatically satisfied in this case. This is the situation, mentioned in 
Section 1, which occurs in the classical theory of periodic solutions of auto- 
nomous systems. 


6. Perturbation of autosynartetic solutions. We now suppose that the 
system (1.1) contains a parameter » and that the same may also be true of 
the transformation (1.4). We therefore write 


(6.1) dx/dt f(t, 2, ») 
instead of (1.1), and 
(6.2) s=P(t,z,,), y =h(t, 2, 


instead of (1.4). We suppose that the dependence of f, P, and h on yp is of 
class C’ and that (6.1) is transformed into itself by the transformation (6. 2) 
for each value of such that | < £. 


THEOREM 6.1. Let ¢(t) be a nondegenerate autosynartetic solution of 
(6.1) under the transformation (6.2) when p=—0. Then there exists a 
positive number B*=B, such that (6.1), for |p| <B*, admits an auto- 
synartetic solution x(t,u) under the transformation (6.2). Moreover the 
dependence of x(t,u) on p is of class C’ and x(t,n) as p> 0, and 
z(t,u) ws the only autosynartetic solution of (6.1) in a suitably chosen 
neighborhood of $(t). 


Proof. According to Theorem 3.4 (with t,—0), a necessary and 
sufficient condition that a solution z(t,m) be autosynartetic under (6.2) 
is that 


(6.3) t[P(0, —h(0, ») = 0, 


DIFFERENTIAL EQUATIONS. 17 


where 2% —=2o(u) =2(0,n). Suppose is the solution of (6.1) such 
that 2,0) Then (6.3) may be rewritten 


(6. 4) y[P (0, To; Lo — h{0, Loy = 0, 


which we wish to solve for 2) as a function of » for | »| sufficiently small, 
already knowing, of course, that (6.4) is satisfied by 7 ¢(0) when p»=0. 
A straightforward calculation of the Jacobian J of the left member of (6.4) 
with respect to % at »—=0, % = (0), leads, with the help of (4.19), (4.10) 
and (4.9alt.) to the result, J—=X(T’) —BX(0), where T= P(0,¢(0),0) 
=p(0) and where X(t) —ya,(t,(0),0) is a matrix solution of the varia- 
tional equations (4.1), set up using f(t,2,0) of (6.1) in place of the f(t, x) 
of (1.1). If detJ were zero, there would exist a vector c40 such that 
[xX (7) —BX(0)]c=0, and then according to Theorem 4.4, é(¢) = X(t)c 
would be a non-trivial autosynartetic solution of the variational equations, 
contrary to the hypothesis that ¢(¢) is non-degenrate. Thus detJ=0, and 
applying the implicit function theorem, we write x(t,y) =wW(t,2o(u),). 
This x(¢,) is easily seen to have the properties stated in the theorem. 

Most of the details in this proof are left to the reader because of the 
similarity to the well known periodic special case. 


7. Lemmas on non-homogeneous linear systems. In order to give a 
satisfactory analysis of the perturbation of autosynartetic solutions which are 
not non-degenerate, we cite the following lemma which is a slight variation 
of Lemma 1 in [4]. 


Lemma 7.1. Consider the linear differential system 


(7.1) dé/dt = A(t)é+f(t), 


where é and f are n-vectors and A is ann Xn matrix. A and f are known 
continuous functions of t, defined on the closed interval <0, T> between 0 and 
T (T may be either positive or negative, but not zero). Let X(t) be any 
n Xn matrix, such that dX/dt =A(t)X and detX(0) 40 (and hence also 
det X(t) AO for any t on <0,T>). Let n—k denote the rank of thenXn 
matriz BX(0)—X(T), where B is a constant nXn matrix. Thus k is 
the number of linearly independent solutions of the homogeneous system 
corresponding to (7.1) satisfying the “autosynartetic” boundary condition, 
BE(0) =€(T). 

Then there exist (independently of f) a k Xn matrix function =(s) 
and ann Xn matrix G(t,s), both continuous, except that G(t,s) possesses 
a finite jump at ts, having the following properties: 


2 


18 D. C. LEWIS. 


(I) The system (7.1) possesses a solution satisfying the boundary 
condition Bé(0) =€(T), tf and only af 


(7.2) 
(II) If (%.2) ts satisfied, the vector function, 
g(t) 


is a solution of (7.1) and is, moreover, the only solution satisfying the 
boundary condition Bé(0) =€&(T) which ts orthogonal to every solution of the 
corresponding homogeneous system taken with the same boundary condition. 


(III) Whether (7.2) is satisfied or not, E(t), defined by (7.3), satisfies 
the boundary condition Bé(0) = é(T). 


(IV) The rows of =(s) are orthogonal to the rows of G(t,s). That 1s, 
T 
(7. 4) f G(t,s)#(s)’ds =0. 
0 
(V) The kXk matriz, 


(7.5) f =(s)=(s)’ds, 
0 

is nonsingular. 


This Lemma may be regarded as well known. See for example [6], 
where however, the Lemma does not appear in exactly the desired form. 
In Lewis [4], pp. 537-540, will be found a complete proof? in the special 
case B=IJ. Only trivial modifications are needed in this proof to establish 
the lemma in its present form. Although we shall thus omit the proof of 
Lemma 7.1, we must record for future use certain supplementary facts. 

Since (n—k) is the rank of BXY(0) —X(T), there is a k XK n matrix 
WY and an n X k matrix B, both of rank k, such that 


(7.6) N[BX(0) —X(T)]=0, [BX(0)—X(T)]B=o. 


In terms of %, the k X n matrix function =(s) of the Lemma may be 
taken as 


* There is one formula in this proof which is in error. Namely the last formula on 
p. 539 should be R(t) = f"@(t,s)=Z(s)’C~ds. This error fortunately does not effect 


the validity of the rest of the proof. 


DIFFERENTIAL EQUATIONS. 19 
(7.7) =(s) =UX(T)X(s)7. 


Cf. Lewis [4], p. 537, formula (2.9). The & rows of =(¢) are also seen to 
afford a full set of linearly independent autosynartetic solutions of the adjoint 
system (4.15). This is established with the help of (7.6), Theorems 4. 5, 
4.6, and the well known fact that the rows of X(t)-* satisfy the adjoint 
system. 

Finally 


(7.8) f (t)’X (t)B dt] 40. 


Cf. Lewis [4], p. 538, formula (2.15) and the accompanying discussion, in 
which it is shown that the matrix in question is positive definite. 


Lemma 7.2. Under the same hypotheses as in the previous lemma and 


using the same notation, the following (n+k)xX(n+k) matrix has a non- 
zero determinant: 


| 


T 
| [BX(0) —X(T)] J 


(t)'X (t) dt | 
0 


where the block in the upper left hand corner (viz. BX (0) —X(T)) ts an 
n Xn matrix, while the block in the lower right hand corner (viz. R) ts an 


arbitrary kX k matriz. 


Proof. Suppose det9i—0. Then there would exist a linear relation- 
ship between the columns of t with coefficients which would not all be zero. 
In other words, there would eaist an n-vector B and a k-vector a, whose 


components are not all zero such that 
T 
(7.9) [BX (0) —X(T)]B-- cf X (T)X (s)-*#(s)’ds]a = 0, 
70 
T 
(7.10) Lf (t)’X (t)dt]B—Ra—0. 
0 
Consider now the n-vector function 
t 
(7.11) é(t) =X(t)B— f X(t)X(s)*E(s)’a ds. 
0 


This &(¢) clearly satisfies the equation (7.1) in the special case that 
f(t) =—(t)’a«. Moreover (7.9) expresses the fact that &(¢) satisfies the 


20 D. C. LEWIS. 


boundary condition, Bé(0)—é(7). Hence Lemma 7.1 assures us, by 
means of (7.2), that 


T 
f, 
0 


Referring now to (V) of Lemma 7.1, we conclude that «0. Thus 
= X(t), whereas (7.10) expresses the fact that é(¢), which we already 
know satisfies the boundary condition, is also orthogonal to every solution, 
satisfying the same boundary condition, of the homogeneous equation 
dé(t)/dt = A(t)é(t). In particular €(¢) must be orthogonal to itself. 
Hence it must vanish identically, i.e. X(¢)B=—0, and since X(t) is non- 
singular, we conclude that B—=0. With both a and £B necessarily reducing 
to zero, we have reached a contradiction of the above italicized statement 
resulting from the assumption that det #t—0. This is all that is needed 
to establish the lemma. 


8. The bifurcation equations in the degenerate cases. Suppose that 
¢(¢) is an autosynartetic solution of (6.1) under the transformation (6.2) 
when »=0. We consider the variational equation (4.1) with A(t) 
= f.(t,¢(t),0). We suppose that ¢(¢) has degeneracy which means that 
(4.1) has just & linearly independent solutions satisfying the boundary con- 
ditions (4.18). In referring in this way to the material of Section 4, we 
mean that the f(t,xz), h(t,x) and P(t,x) of Section 4 are to be identified 
with the f(t,7,0), h(t,2,0), and P(t,z,0) respectively of this Section or 
Section 6. 

We define the n X n matrix X(t) by the two conditions 


(8.1) dX (t)/dt = A(t)X(t), X(0) =I, 
and then introduce the matrices Y, 8, =(¢) as in Section 7. 


THEOREM 8.1. It is possible to define a unique n-vector function x(t, c, p) 
and a k-vector function a(c,p) of class C’ for sufficiently small c and |p|, 
where c is a k-vector, in such a manner that the following four conditions 
are fulfilled: 


(8.2) a(t, =f[t, x(t, ¢,u), 4] —E(t)’a(c, ») 


(8.4) f, B’X (t)’[2(t, c, w) —$(t) —X(t)Bc]dt =0 


DIFFERENTIAL EQUATIONS. 21 


(8.5) a(t,0,0) =¢(t). 


Proof. We introduce the n-vector function y(t, 2, %, 4) in such a manner 
that 
(8. 6) y(t, Lo, 


is the solution of the differential system, 

(8. 7) (t,x, —2(t)’a 

which reduces to %, when ¢=0. That is, 

(8.8) (0, Zo, = 2X. 

Other more or less obvious properties of y are the following: 
(8.9) W(t, 6(0),0,0) = $(t). 

(8.10) W2,(t,¢(0),0,0) = X(t). 


(8.11) $(0),0,0) —o(t) —— f X(t) 


where o is an Xk matrix which vanishes when ¢—O and satisfies the 
system do/dt—A(t)o—F(t)’. To give some indication of how these 
properties are derived, we remark that (8.9) follows from the uniqueness 
theorem for (8.7) and the definition of ¢(¢). We derive (8.10) when we 
substitute y(t, for z in (8.7), differentiate with respect to Zp, set 
%o=(0), and use the definition of A(t) —f,(t, 6(t),0) 
and (8.1). We get (8.11) in a similar manner by differentiting with respect 
to a. The expression for a(t), of course, comes from the Lagrange result for 
“variation of parameters.” 

We next try to find z and @ as functions of c and yw in such a manner 
that, when the functions are inserted in (8.6), y(t, %(c,),«%(C,u),) reduces 
to the required x(t, c,u) satisfying conditions (8.2), (8.3), (8.4), and (8.5). 
No matter how xo(c,u) and a(c,y) are chosen (8.2) is automatically satisfied 
because y was defined to be a solution of (8.7) ; and, if furthermore we require 
that 


(8. 12) (0,0) = (0), 
(8.13) (0,0) =0, 


we see that (8.5) is also satisfied because of (8.9). The other two con- 


22 D. C. LEWIS. 


ditions (8.3) and (8.4) lead to the following vector equations for the 
determination of z) and @ as functions of c and p. 


(8. 14) —y[P(0, y(0, Lo, &, Los a, +- h{0, (0, Lo, ra = 0. 


(8.15) f BX 24,0) —X 


We note first of all that, consistently with (8.12) and (8.13), the last two 
equations (viz. (8.14) and (8.15)) are satisfied when 7—¢(0), a—0, 
c=0, and This is because (8.9) and the fact that ¢d[P(0, (0), 0) | 
= h[0,¢(0),0], since ¢ is given as an autosynartetic solution of «= f(t, z, 0) 
under the transformation s=P(t,z,0), y=h(t,z,0). 

Hence, if we can show that the jacobian of the system furnished by (8. 14) 
and (8.15) with respect to the components of 2) and a does not vanish when 
= a= 0, c—0, and »—0, these equations, (8.14) and (8.15), by 
the implicit function theorem, effectively give the required functions Zo(c, ») 
and a(c,u) for sufficiently small values of c and p. Hence the proof of 
Theorem 8.1 will be complete as soon as we show that this jacobian is not 0. 

Differentiating the left member of (8.14) with respect to x and then 
setting %—¢(0), «0, »—0, gives with the help of (8.9) and (8.10) 
the following: 


ha[0, 0]X(0) — $(0), 0), (0), 0, 0}°P2(0, 0) — X(P(0, 4(0), 0)), 


which, from the facts that 1(0) that P(0,¢(0),0) and that y 
satisfies (8.7) and (8.9), can be written in the form, 


[h2[0, $(0), 0] 
— fLPO, (0), 0), 6(P(0, 0)), 0]°P2(0, 0) ]X(0) — X(T). 

Hence, from (4.9 alt.) and (4.19), we find that the jacobian matrix of 
(8.14) with respect to the components of x, at the point in question is the 
n Xn matrix BX(0)—X(T), which is just the block of elements in the 
upper left hand corner of the matrix Yt of Lemma 7. 2. 

Differentiating the left member of (8.14) with respect to a, using (8. 11) 
and evaluating at —¢(0), »=0, yields 


f 
0 


which is the block of elements in the upper right hand corner of {. 


DIFFERENTIAL EQUATIONS. 23 


Differentiating the left member of (8.15) with respect to x, yields in a 
similar manner the k X n matrix, 


(Hate 


which is the block of elements in the lower left corner of Mt. 
Differentiating the left member of (8.15) with respect to x yields in a 
certain kX k matrix R, whose properties need no further investigation and 
which furnishes the block of elements in the lower right corner of Mt. 
In other words, the jacobian we are interested in turns out to be det M, 
which we know from Lemma 7.2 is not zero. This establishes Theorem 8.1, 
except for a few details which we leave to the reader. 


THEOREM 8.2. If the k-vector c and the scalar wp satisfy the k-vector 
equation, 
(8. 16) a(C, = 0, 


then the n-vector function x(t,c,p) 1s an autosynartetic solution of (6.1) 
under the transformation (6.2). Here tt is understood that a(c,u) and 
x(t,c,) are as introduced in Theorem 8.1. 


Proof. Because of (8.2) and (8.16), a(é,c,u) is a solution of (6.1). 
Because of (8.3) and Theorem 3.4 (with f,—0, etc.), x(t,c,n) is also 
autosynartetic. 

THEOREM 8.3. Jf X(t) ts an autosynartetic solution of (6.1) under 
(6.2) and if |u| and | x(t)—¢(t)| are sufficiently small, there exists a k- 
vector c such that (8.16) is satisfied and such that <(t) =<x(t,c,p). 

Proof. Using the function y(t, 20,a,) introduced at the beginning of 
the proof of Theorem 8.1 (cf. (8.6)), we see from the uniqueness theorem 
for differential equations that 


(8.17) Z(t) = y(t, &(0), 0,4). 

Since @(¢) is given as autosynartetic under (6.2), Theorem 3.4 shows that 
#[P(0,#(0),)] or, using (8.17), we get 

(8. 18) y[P(0, z(0), 0, = h(0, 


Since (0) —y(0,2(0),0,”), we see from (8.18) that (8.14) is satisfied by 
%yo=7(0) and a=0. 
We define the k-vector c by means of the system of linear equations 


24 D. C. LEWIS. 


(8.19) 400), 0, —X()Beldt—0. 


The determinant of this system in the components of c is seen by (7.8) to be 
different from zero. Hence (8.19) gives a valid definition of c; and moreover 
|c| will be small if || and | Z(0)—¢(0)| are sufficiently small. With 
this value of c we see, by comparing (8.19) with (8.15), that the latter 
equation is satisfied by x —2Z(0) and «—0. 

But the complete implicit function theorem contains a statement to the 
effect that there are no solutions near the given solution except those furnished 
by the implicitly defined functions. Applying this statement to the system 
(8.14)-(8.15), and using the notation 2(c,u), as well as a(c,u), intro- 
duced in the proof of Theorem 8.1, we find that 2(0)—2(c,u) and 
0=a(c,y). Hence by (8.17) we have 


C, = yt, Zo(C, a(c, mu), = y(t, 0, a] = Z(t), 
as we wished to prove. 


DEFINITION 8.1. The k-vector function a(c,u) being introduced as in 
Theorem 8.1, the k-vector equation (8.16) (or the system of k scalar equa- 
tions (8.16)) will be called the bifurcation equation (or equations) for the 
autosynartetic solution o(t) of degeneracy k. 


Theorems 8.2 and 8.3 may be summarized by the statement that the 
problem of the perturbation of an autosynartetic solution of degeneracy k 
can always be reduced to the problem of solving a system of k “bifurcation” 
equations instead of the generally larger system of n equations like (6.4). 
An example of the advantage of using bifurcation equations is indicated in 
the following 


THEOREM 8.4. Suppose that the system (6.1) admits 1 independent 
autosynartetic first integrals of class C’ in t, 2, and p. Suppose also that for 
w=0, is a gwen autosynartetic solution of (6.1) of degeneracy k= 1; 
and suppose that the k-vector function a(c,y) 1s set up as in Theorem 8.1. 
Then there exists an 1X k matrix function Q(c,y) of rank 1 (when |c| and 
|| are sufficiently small), such that 


(8. 20) w)a(c,m) =0. 


Proof. Let the / autosynartetic first integrals be represented as the 
components of the l-vector A(t,z,y). From Theorem 5.2, we know that the 
rows of the / Xm matrix A,(t,z,y) evaluated for c—¢(t) and »—0O are 


DIFFERENTIAL EQUATIONS. 25 


autosynartetic solutions of the adjoint to the variational equations. Since 
also the k X n matrix E(t) of (7.7) is such that its k rows afford a full set 
of linearly independent autosynartetic solutions of the adjoint system it is 
clear that the rows of Az(t¢,¢(t),0) are linear combinations of the rows of 
=(¢). In other words there exists a constant / X k matrix D such that 


(8. 21) Ao (t, p(t), 0) = DE (tz). 


Moreover, since the / first integrals are independent A,(t,¢(t¢),0), and hence 
D, must be of rank 7. If now we define the / X n matrix 


(8. 22) =Az(t, x(t, c,u),4) —DE(t), 


where x(t,c,u) has the meaning explained in Theorem 8.1, we see from the 
continuity of x(t,c,y) and of A,(t,2,y) and from (8.21) that 
uniformly on any finite ¢-interval as | c| and | »|—>0. 


Since A(z, 2z,) is a vector first integral, we have 


Ay(t, w) + Ao(t, w) f(t, =0. 


In this identity, we replace z by the vector x(t,c,y) of Theorem 8.1 and 
integrate from 0 to 


(8. 23) T'(c, =P(0,x(0,¢, 
We thus obtain 
T(c,p) 
f, Aa(t, a(t, 2(t, ¢, 


(8. 24) T (cm) 
+f At(t, x(t, c, dt =0. 
0 


Since A is autosynartetic, we have (by Definition 5.1) 


(8. 25) A(t, p) =A(P(t, 2, h(t, 


Setting t—0, r—2(0,c,n), and remembering that h(0,2(0,c,,),p) 
=2(T(c,y),c,u) by Theorem 8.1 and (8.23), we find from (8.25) that 
A[T (c, —A(0,2(0,¢,n),~) =0. Hence 


T(c,q) 


e 

r 

r 

1 

1 

t 

r 


26 D. C. LEWIS. 


Hence 
A,[t, x(t, Cp), c, dt 


(8.26) T (csp) 
A;[t, x(t, w]dt=0. 


Substracting (8.26) from (8.24) yields 


But, from (8.2), this last identity may be written 


T(c,m) 
A,[t, x(t, C,u), E(t)’a(c, dt =0. 


We therefore obtain (8.20), if we let 
* T(c,m) 


Q(c, ») A,(t, x(t, (t)’dt. 


Evidently from (8. 22) 


+ 


Now T(c,n) > T(0,0) =T as and p> 0 and 0 uniformly. 
We also know that f =(t)=(t)’dt is a non-singular k X k matrix (cf. Lemma 
0 


7.1 (V)). It therefore follows that Q(c,») must, like D, have the rank / 
for |c| and || sufficiently small. 


A simple corollary of Theorem 8.4 is the following 


TueoreM 8.5. If there are | independent autosynartetic first integrals, 
the perturbation of any autosynartetic solution of degeneracy k (necessarily 
= 1 by Theorem 5.3) may be effected by the solution of k—l1 of the bifur- 


cation equations, the other | bifurcation equations being then automatically 
satisfied. In particular, if k =1, the bifurcation equations are all identically 


satisfied. 


Proof. Since Q(c,m) is of rank 1, we may solve (8.20) for 1 of the 


DIFFERENTIAL EQUATIONS. 27 


components of «, say, in terms of the other components @41,° %% 
so that we have equations of the form 


k 
a (C, Qij(C, &j(C, 


Hence, if a;(c,u) =0 for we also have for 


THEOREM 8.6. Suppose that the system dx/dt=—f(t,2,0) admits 1 
independent autosynartetic first integrals of class C’ in t and x (even though 
the system dx/dt—f(t,2,n) for p40 may not). Suppose also that (t) 
is a given autosynartetic solution of (6.1) under (6.2) when p=0 and that 
the k-vector function a(c,p) is set up as in Theorem 8.1. Then there exists 
an lXk matrix function Q(c) of rank 1, when | c| is sufficiently small, such 
that 
(8. 27) Q(c)a(c,0) =0. 


Proof. Consider the modified system dx/dt = f*(t,2,u) where f*(t, x, 
=f(t,2,0) is a constant with respect to variation in w. Then the given / 
autosynartetic first integrals of «=f (t,z,0), which are also independent of p, 
are autosynartetic first integrals of dv/dt = f*(t,z,u) under the modified 
transformation s = P*(t, x, = P(t,2,0), y=h*(t, 2, =h(t, 2,0). Hence 
Theorem 8. 4 applies to the modified system under the modified transformation 
and we get an 1X k matrix, Q*(c,m), of rank J for || and | c| sufficiently 
small, such that 
(8. 28) Q* (c, w)a*(c, =0, 


where a*(c,) is to the modified system as a(c,) is to the original system. 
The fact that a*(c,n) =«a(c,0) follows from the following two facts: (1) 
The equations for the determination of «#*(c,u) and x*)(c,m), namely equa- 
tions (8.14) and (8.15) set up for the modified system, are seen to be 
completely independent of p, so that «*(c,u) =a*(c,0). (II) These same 
equations are identical with the original equations (8.14) and (8.15) when 
p=0, so that a*(c,0)=a(c,0). Thus, from 8.28), we find that 
Q*(c, »)«(c,0) = 0, from which we get (8.27) by choosing Q(c) = Q*(c, 0). 

We next turn to the consideration of properties of the bifurcation equa- 
tions not dependent on the existence of autosynartetic first integrals. The 
general situation is exhaustively treated in the next two theorems. 


j= 


D. C. LEWIS. 


THEOREM 8.7. The left hand member of the bifurcation equation (8. 16) 
always possesses the following two properties: 


(8. 29) a(0,0) =0 
(8. 30) a(0,0) =0. 
Proof. From (8.5) and (8.2) we have 

dg (t) /dt = f[t, p(t), 0] —E(t)’a(0, 0). 


But ¢(¢) was given at the outset as a solution of (6.1) when p»=0. Hence 


dp(t)/dt =f[t, $(t), 0]. 


Therefore =(t)’«(0,0) —0. From (7.7) we know that the rank of & is k. 
Hence «(0,0) 

Incidentally, this simple proof of (8.29) or (8.13) as a necessary 
consequence of (8.2) and (8.5) is one of the details left to the reader at 
the end of the proof of Theorem 8.1. It is a step in the proof of the 


uniqueness part of Theorem 8.1. 
We next establish (8.30) as follows: With the help of (8.8) we write 
equation (8.14) in the form, 


yLP(0, Lo, —h(0, %, = 0. 


This equation is satisfied by 2)—2)(c,u) and a—a(c,u). Hence, upon 
setting »—0, we obtain 


y[P (0, to(c,0),0), to(c, 0), a(c,0),0] —h(0, ro(c,0),0) =0. 


Differentiating with respect to c, freely using the various properties of y 
indicated in formulas (8.6) to (8.11), and also remembering the definition 
of B(0) given by (4.9 alt.), we find, on setting c—0 and 2,(0,0) ~¢(0), 
that 
— B(0)(d20/0c) + X[P(0, $(0), 0) 
o[P(0, 0) ](4a(0, 0)/dc) =0. 


Since B(0) = B, X(0) —/J, and P(0,46(0),0) —T, we therefore have 


[X(T) —BX (0) ] + 0(T) (8a(0, 0) /dc) —0. 


28 
| 
( 


DIFFERENTIAL EQUATIONS. 29 


Hence, from (7.6), %o(Z’) (da(0,0)/dc) =0, while from (8.11), (7.7), and 
(V) of Lemma 7.1, we see that 


is a non-singular matrix, so that (8.30) follows immediately. 


THEOREM 8.8. The bifurcation equations in general have no distinctive 
properties other than those specified in Theorem 8.%. More precisely: 

Let B(c,) be an arbitrary k-vector function of the k-vector c and the 
scalar p. Let B(c,pu) be of class C’, and let it vanish together with tts first 
partial derivatives with respect to the components of c when c=0 and p=0. 
Let n be any integer =k and suppose T 1s any positive number. Then there 
exists a system (6.1) of order n such that f(t+T,2,y) =f(t,2,u), which 
possesses, for 10, a periodic solution of period T and degeneracy k, and 
whose bifurcation equations are B(c,u) =0. 


Proof. We proceed to set up such a system. For this purpose, the first 
k components of the n-vector x will be thought of as a k-vector 2’, while the 
last (n —k) components of will be called the (n—k)-vector 2”. Let A’ (t) 
be any (n—k)xX(n—k) matrix of continuous periodic functions of ¢ of 
period, 7’, such that the linear system dz”’/dt =A’ (t)z” has no non-trivial 
periodic solution. A”(¢) could be a constant matrix, if desired. Then it is 


claimed that the system 
(8.31) dx’/dt = B(2’,), /dt = A’ (t)2”, 


admits, for »4—0, the periodic solution z’ 0, 2’ —0, and that the bifurca- 
tion equations (8.16) for the perturbation of this solution are such that 


(8. 32) a(C, m) =B(C,), 


where, of course, it is understood that a(c,u) is to constructed so as to 
satisfy the conditions of Theorem 8.1. Since Bz (0,0) —0, the variational 
equations take the form 


dé /dt—=0,  dé’/dt =A’ (t)é”. 


The n X n matrix solution X(t) of this system such that X(0) —TI is seen 
to have the form 


6) | 
ce 

k. 

ry 

at 

1e 

n 


D. C. LEWIS. 


where I’ is the & Xk identity matrix while X¥”(t) is an (n—k)X(n—k) 
matrix, no linear combination of whose columns is periodic, while X”(0) =I”, 
the (n—k)X(n—k) identity matrix. In formula (7.6) the matrices B, 
9, and B should now evidently be interpreted as follows: 


B=I, %-=(I',0), 


From (7.7) we then find that E(s) = (I’,0) so that (t)'a—( )=(3)° 


Hence (8.2) becomes 


The condition (8.3) reduces, of course, to 


(II) a’ (Tc, =2'(0,¢, (T,c,u) = 2" (0,¢,p). 


From the fact that ¢(¢) =0 in the present example, a routine calculation 
shows that (8.4) reduces to 


(III) fw 


while (8.5) is simply 


(IV) x(t,,0,0) =0. 


According to Theorem 8.1, these conditions determine x(t,c,u) and a(c, p) 
uniquely. But we see by inspection that all conditions are satisfied, if we 
take z’(t,c,n)=c, 2’ (t,c,n) =0, and a—a(c,p) —B(c,n). This com- 
pletes the proof of (8.32) and, therefore, of the theorem. 

We close this section with a brief consideration of the case when (6.1) 
is transferred into itself by a family of transformations, 


(8. 33) s== P(t, z,p,X), y=h(t, x, p,2), 


where the parameter d is a scalar, or, more generally an m-vector. The base 
solution ¢(¢) for »—0 is supposed to be autosynartetic under (8.33) for 
#=0 andA=0. We proceed exactly as in the earlier case when the trans- 
formation did not depend on A; only now our bifurcation equation (as well 


30 


DIFFERENTIAL EQUATIONS. 31 


as the function x(t,c,u) of Theorem 8.1) will depend on A. We therefore 
write our bifurcation equation in the form, 


(8. 34) a (Cc, A) =0. 


Evidently the presence of the new variable \ increases the probability of our 
being able to solve the bifurcation equations. Thus (8.34) may have no 
solution for which »0 and A=0, but it might very well have some for 
which 40 and A0. In the latter case, we are led to an autosynartetic 
solution of (6.1) under (8.33) for values of A0. 

The most familiar case in which this situation arises is in the perturba- 
tion of a non-constant periodic solution of an autonomous system. Some 
remarks about how this case fits into our general theory have already appeared 
in Sections 1 and 5. 


9. Miscellaneous comments. The reader may notice that the author 
has abandoned the method of integral equations used in his previous papers, 
especially [3]. The reason for this is primarily that the interval of 
integation, which would be between 0 and P(0,x(0,y),m) is, in general, 
dependent on the unknown solution x(¢t,) as well as upon the parameter up. 
A secondary reason is that the autosynartetic boundary conditions for the 
variational equations are not the same (in general) for different autosynartetic 
solutions. In the special case, when P(t, 2z,) is independent of both z and p» 
and h(t,x) is linear and homogeneous in 2, as is the case in the study of 
the perturbation of nonautonomous periodic solutions, these objections are no 
longer valid. In such cases the integral equation method is entirely feasible 
and, indeed, would afford the best available method of estimating the p- 
interval over which perturbation is possible. 

Likewise the reader will observe that only a part of Lemma 7.1 is 
essential for the purposes of the present paper. In particular no use is made 
of the “generalized Green’s matrix,” G(t,s). Nevertheless, for the sake of 
completeness and in view of the indispensability of this G(t,s) for the integral 
equation method in the fairly general case mentioned above when it is feasible 
and useful to employ this technique, it seemed desirable to take the oppor- 
tunity of presenting Lemma 7.1 in its complete form. 

Our theory of the bifurcation equation, when specialized to the periodic 
case, is seen to parallel very closely a theory outlined by Friedrichs [1]. 
Friedrichs, however, would seem to replace our condition (8.4) by a condi- 
tion of the form, 


(9.1) wa (0, = 6, 


) 

n 
) 

) 


32 D. C. LEWIS. 


where w is a constant k X n-matrix, while c is a k-vector, as before. The 
present author felt that (8.4) was a more “natural” condition, perhaps 
because of his previous exposure to the work of Ernst Holder [2]. It is 
probably possible to include both conditions, namely (8.4) and (9.1), under 
a more general condition, if we tamper somewhat with our Lemma 7.1, 
using a more generalized notion of orthogonality, which would involve the 


use of weight factors and Stieltjes integration. Such a generalization would 
probably be a mere tour de force devoid of substantially new results. 


THE JOHNS HOPKINS UNIVERSITY AND RIAS. 


REFERENCES. 


[1] K. O. Freidrichs, “ Fundamentals of Poincaré’s theory,” Proceedings of the Sym- 
posium on Non-linear Circuit Analysis, Polytechnic Institute of Brooklyn, 
1953, pp. 56-67. 

[2] Ernst Hélder, “ Mathematische Untersuchungen zur Himmelsmechanik,” Mathe- 
matische Zeitschrift, vol. 31 (1930), pp. 197-257. 

[3] E. Kamke, Differentialgleichungen, Lisungsmethoden, und Lésungen, Akademische 
Verlagsgesellschaft, Leipzig, 1956. 

[4] D. C. Lewis, “ On the role of first integrals in the perturbation of periodic solutions,” 
Annals of Mathematics, vol. 63 (1956), pp. 535-548. 

[5] Henri Poincaré, Les méthodes nouvelles de la mécanique céleste, Gauthier-Villars, 
Paris, 1892. 

[6] W. T. Reid, “ Generalized Green’s matrices for compatible systems of differential 
equations,” American Journal of Mathematics, vol. 53 (1931), pp. 443-459. 


a 
te 
W 
t] 

( 
( 

W 
tl 
fo 
(. 
W. 

D 
Pr 
pa 
se| 


CALCULATION OF CLASS NUMBERS BY DECOMPOSITION INTO 
THREE INTEGRAL SQUARES IN THE FIELDS OF 23 AND 33.*? 


By Harvey CoHNn.? 


12. Introduction. The results presented here are a continuation of 
earlier results on modular functions of two complex variables of the author 
[16] relating to the field of 24 and 34. Their significance is that these results 
broaden the range of those modular functions which illustrate elegant prop- 
erties too refined to be concluded wholly from such vast theoretical structures 
as that of Siegel [15]. The results have, in addition, a useful relationship 
to class-number calculations, which we proceed to summarize. 


Consider the real quadratic field R(D*) with discriminant D> 0, and, 
within the field, consider a totally positive integer 4(>>0) which might be 
specialized to be rational. (When D1, the field will be taken as rational.) 
We assume y to be square-free for simplicity. We let H(D4, (—)4) denote 
the class number of #(D4,(—,»)4) and we let A;(v) denote the number 
(possibly zero) of decompositions of type 


(12.1) v= + + 


where & are integers in R(D4) and every permutation or change of sign in 
the triple (é, &, 3) is tallied as an additional decomposition. 


The class-number relationships which concern us, have essentially the 
form 
{As (my?) = (D3, (—p)!) 


12.2 
( ) square-free and totally positive (>> 0), 


where D=1,5,8, and 12. The factor y? is needed in (12.2) because when 
D=8 or 12, v must belong to ©, the ring of integers v of R(D4) of form 
a+ bD%. Thus even if D2, we can conclude py? € when we set 


* Received May 6, 1960. 

* Work supported by Research Grant G-7412 of the National Science Foundation. 
Presented to the American Mathematical Society, January 27, 1960. 

* The numbering of sections and bibliographical items is consecutive with the earlier 
paper [16]. With the exception of some general theorems of §7 (et al.) this paper has 
self-contained proofs, but perpetuates the earlier notation. 


33 
3 


= 


HARVEY COHN. 


for D=1,5, 

for D=8,12 and pE D2, 
for D=8 and p¢ 

y=1-+ 34 for D=—12 and p¢ D>. 

The values of G are given in the accompanying table. These values depend 
on the manner in which the ideal (2) factors in R(D4, (—yp)#*) into distinct 
prime ideal factors p, q. A list of cases based on easily recognized residue 
classes is given in the adjoining column: 


D G Factors of (2) Residue classes of 


0 (mod 8) 


24 
12 misc. 


96 7, 3, (13 + 54)/2, (9 + 3 - 54)/2 (mod 8) 
5 + 2-58, (5 + 54) /2,(9+5-58)/2 
12 misc. 


48 7,54 2-28 (mod 4 - 24) 
96 3,1-+ 2-28 
24 misc. 


24* 7,5 (mod 4(1 + 33)) 
3,1 « 
12* misc. 


(* See further explanation below.) 


Here, for simplicity, we have further restricted » to the case where the 
fields R( D4) and R(D4, (—)4) have precisely the same units. The exceptions 
are as follows (ignoring factors of » which are squares for R(D)): 


for D=1, 5,8, 12, 
p= (5+ 54) /2 for D=5, 
p=2-+ 3h for D—12. 


(12. 4) 


In the first three cases, a complex root of unity is introduced into R( D3, (— »)!) 
and in the last case a new fundamental unit (— 2 — 34)! is introduced that 
was not present in R(D4). An independent calculation reveals that in the 
cases (12.4) the class number H(D#, (—,)4) is unity, when 5,8. 
(A formula of type (20.16) below, reconciles these exceptions. ) 


34 
1 
5 
(Maass) 1 
] 
f 
f 
b 
t 
il 
ir 
CO 
th 
fo 


CALCULATION OF CLASS NUMBERS. 35 


The basic formula (12.2) fails, however, when D—12 (see * in the 
table), and the formula (12.2) is an “approximation” valid only to the 
extent described in the next section. Considering D=8, we know for any 
integers a, b with a> |b | 84, we can represent 


(12. 5) — (ay + yr2h)? + + + + ys24)? 


in terms of integers x, y; Thus a theorem for three squares supersedes the 
one for four squares in the earlier work [16]. The preponderance of numerical 
data [8] makes it impossible, however, to reject the hypothesis that three 
squares suffice for D = 12. 


13. Remarks on method. There are three different methods for estab- 
lishing a formula of type (12.2). The first method is the method of Gauss 
which is based on a direct connection between the enumeration of quadratic 
forms end representations like (12.1). This method as been applied with 
success only to the rational case, D—1. 

The other two methods involve the use of some kind of theta series @(r). 
The function @*(7) is, by some means or other, expanded into a “ 
series” (7) to yield identity (12.3) through a comparison of coefficients, 
as did Jacobi in his classic proof for ®*(7) in the rational field. 

There are differences, however, in general methods for establishing 
@'—wW, the basic identity. The “direct” method was applied (to the 0° 
case) by Kronecker [4, p. 109], (see Mordell’s version [23]), but in the 
rational case only, (D—1). 


singular 


We shall see that, in the quadratic case, ©* and W satisfy a system of 
linear functional equations (like that in §16 below), which happens to make 
for a one-parameter family of solutions. This method could have been used 
for D=5 by Maass; we shall use it for D8 (as done earlier [16] for @*). 
Indeed, the identities in § 24 are almost identical to those that would emerge 
for D=5, so, in effect, an alternate proof of Maass’ result [12] is provided 
by this paper. 

The less direct but deeper method is due to Siegel [15] and is based on 
the fact that the form é,? + é,? + &? in R(D#) is the only equivalence class 
in its genus. The method was carried out by Maass for D1 [21], and 
D=5 [12]; (the discovery of the role of the @-function and class number 
in modular functions is due to Hecke [17]). 

It should be remarked that, however straightforward it may be, the 
calculation of the singular series §§$ 14-21 is a formidable matter, although 
the “proof proper” consists of §§ 22-24, where it is proved that @*— wv —0 
for D=8. 


id 
ct 

8) 
the 
ons 

hat 
the 
, 8. 


HARVEY COHN. 


We shall see that when D=—12, ©*—W is a cusp-form not identically 
zero as evidenced by the failure of equation (12.2), or more specifically by 
the failure at its source (20.16), below. The case D 12, however, as the 
case of four squares, appears to be within the scope of some kind of ezact 
formula for (say) As(uy?). For example, preliminary calculations show 
formula (12.2) to be correct when D—12, if w is a square-free rational 
integer > 1 and relatively prime to 6. 

A continuing study for three and four square representations when D = 12 
is being undertaken with exact formulas as a goal. 


Calculation of Singular Series. 


14, Notation. We continue the earlier notation with Greek letters 
denoting algebraic integers, Roman letters denoting rational integers (except 
for special function-theoretic symbols), and the generalized Norm N and 
Trace § applicable to conjugates in R(D*) and our two so-called “ conjugate ” 
continuous variables 7 and 7’, (each complex). 

We consider the two fields with given discriminant and generator: 


(14.1) 


+B. 

They share the properties of being Euclidean, having the fundamental unit 
(14.2) em1+y 

and having the ring of all algebraic integers equal to 

(14.3) = [1,<] = [1,7], 


(which is required in setting up generators of the modular group, as in § 2). 
In either case, 


(14. 4) (y?) = (2), N (yn) =—2, 
(14.5) [1, 29] = [1, D4]. 
We introduce the simplifying symbol 


(14. 6) 
= exp ¢’) /D3. 


In particular, 


(14. 7) e([r+ sy]/t) = exp 


36 | 


CALCULATION OF CLASS NUMBERS. 


thus for the algebraic integer p (in OD) 
(14. 8) e(p) =1 if and only if p€ Ds. 


Finally, using the complex variables in the argument of e(p), we redefine 


the earlier @z,,(7) as follows: 


(14.9) @(d,c;r) 2 6(vdy + [v + 09/2 ]?r) 


summed over all vy in 9, where, 


(14. 10) 


Imr > 0, 
< 0. 


The following transformation laws are determined from formulas: 
(14. 11) @(d+ 2,c;r) =O(d,c+25;7r) =O(d,c;7), 

(14. 12) @(d,c;e*r) =@(d,c;r), 

(14. 13) @(d,c;7+7) =O(d-+ 1, 

(14. 14) 

(14.15) @(d,c;—1/r) 

for the principal square root of N(r). 


15. Asymptotic relations. To calculate the singular series, we consider 
@(d,c;r) near r= a@/B, (7’=2’/B’), an irreducible, algebraic rational in 


R(D). 
(15.1) @(d,c;%/B +4) 26(vdy + [v + + en/2]?a), 


where A, \’ are a new pair of complex variables satisfying condition (14.10). 
Then, writing 
(15. 2) v =v; + 2Bve, (v, mod 28, vz arbitrary), 
we find 
= e(vidy + [v1 + e( [v1 + 2Bv2 + cn/2]?A). 


But the inner sum, according to the general method, is asymptotically inde- 
pendent of », and c; thus 


(15. 4) ~ 0, 487A) = @(0, 0, )/N 


37 
y 
y 
W 
9 
rt 
d 
” 

V4 V2 
V2 


38 HARVEY COHN. 


according to formula (14.9). Finally, 


(15. 5) @(d,c3a/B +A) = Gae(a/B)/4|N(B)| N 
where 


At this point, we examine the Gaussian-type sum G4,-(a/8) more critically 
and note a new type 


(15.7) Hae(2/B) ¢(vdn + + 
when 
(15. 8) (a + dn) (B+ en) € Do. 


Such a sum is characteristic of the cases D8, 12. In the work of Maass 
[22, p. 716], no condition like (15.8) arises. Indeed, unless (15.8) is valid, 
the residues (mod 28) v and v-+ 8 make cancelling contributions to the sum 
(15.7). Thus, calling Ha,,.(#/8) —0 when (15.8) fails, we have 


(15.9) Gac(%/B) =4Ha-(«/B). 
Finally, changing variables, we say, for 7, 7’ near a/B, a’/f’, 


Ha,.(a/8)sgn N(8)/N(B)AN (Br — (if 1/0); 
(if a/B 1/0). 


(The last item was inserted for completeness. Of course 7, 7’ approach 
infinity consistently with (14.10), and we use the conventional sign and 
delta function.) The square root will be principal, hence consistent with 
t—>0, 7’->0 along the positive and negative imaginary axes according to 
(14.10). 

If we compare the asymptotic behaviors as applied to (14.11), (14.12), 
(14.13), (14.14), we find 


(15.10) @(d,c;7) = 


(15.11) = = 
(15. 12) Ha = 

(15. 13) Ha,e(%/B + 4) = 
(15.14) Hae(%/B + 1) = Haree(%/B), 


which incidentally could be verified from (15.7). A more difficult result is 
the reciprocity: 


CALCULATION OF CLASS NUMBERS. 39 


(15.15) (Ha,(— B/«)/N 


Here 


(15. 16) 
—+ 
The reciprocity relation (15.16) is proved by the Cauchy-Hecke method 


of comparing (15.10) with 
(15.17) H.a(—B/a)sgn N (a)/N(a)§N (ac + 


using (14.15) under the comparison or ——1. The details are very similar 
to Hecke [18] except that we permit V(«)* to be imaginary, thereby forcing 
the “balancing” factor, sgn(a, 8), to be real. 

By virtue of identities (15.11) through (15.14) we can confine all our 


attention to H»,.(«/B8) henceforth abbreviated : 
(15.18) Ho,0(4/B) =H (a@/B) 
Unless 
(15.19) a8 € Do, 
we note H(a/) is trivially taken as 0. 
16. Hecke’s modular function. The superposition of the elements 


(15.10) for @*(d,c;7) does not create an absolutely convergent series until 
k>4. To remedy this fact, Hecke created the series with convergence factor: 


(16.1) (d,c;7;k,s) 

80,6 -+ N(B)/N(p)*?N (Br — a)*/? | N(Br—a)|*. 
Our interest shall center about k = 3, but the general integer k is momen- 
tarily useful. This series converges when 


(16. 2) k/2+Res>2. 


(We shall be spared the tedium of repeating the convergence majorants by 
virtue of the similarity with estimates valid for D5; see [21].) 

By virtue of the relations (15.11) through (15.15) we find, omitting 
symbols k, s when convenient, 


(16.3) W(d,c;7r) =¥(d+2,¢;7) = ¥(d,c+ 237), 


sen(a, 8) =—1 with the following arrays: 
ao B 


40 HARVEY COHN. 


(16. 4) (d,c;er) =W(d,c;r), 

(16.5) W(d,c;7-+ 7) (d,c;7) 

(16. 6) W(d,c;r-+1) (d+ 

(16. 7) W(d,c;—1/r) =W (c,d; 1) e*(cdy?/2)N(r)*/? | 


Hecke’s work [7] shows that the function © has no finite singularities in s, 
hence by analytic continuation we can let s—0. In so doing, we obtain a 
function W satisfying the same functional equations as @*. The significance 
of this fact we leave until later on. Most important for computational purposes 
is the fact that the series (16.1) is susceptible to a fortuitous arrangement 
before making s—0. Again we spare the reader the details on majorants, 
by referring to the similar case where D = 5. 

It suffices to restrict ourselves to the case d—=0, c=0O and simply write 


(16. 8) = 


17. Rearrangement of series. We first write 


aB EDs 
where 


(17. 2) ®(a/B,7) + | N(r—a/B+ 2v) 


(Note that the sgn* N(8) of (16.1) disappears when N (8) is divided out.) 
By making use of the Poisson-Lipschitz formula, again, we find 


(17.3) (2/B, 7) 
— (1/404) (P—a/p | N(P—a/B 


where P and P”’ are real variables and » is summed over ©. Introducing, 
with Hecke, 


(17.4) B(p,7r) f + 7)*/? | N(P +7) 


we find the singular series 


(17.5) V(r) =1+ (B(u, 7) /2D8) P(u), 
where 
(17.6) wa/B)/2 | N(B)|*. 


aBEDs 


CALCULATION OF CLASS NUMBERS. 41 


Now we know, in advance, that only totally positive » can be present in OF, 
(although other » are present before we set s=0). We can see more clearly 
that all » entering into (17.5) are in D2, for, by (15.14), 


(17.7) H(a/B +1) =H(a/B) ; 
but + 1]) =—e(—pa/B) if D2. 


We next introduce the symbol 


(17.8) P(B,n) 
aBpeD, 


aware that H(a*/B*) =0 if a*@*¢O,. Then we find 
(17.9) P(u) == N (8) |**. 


The remarkable simplification now achieved is due entirely to the fact 
that if (81,82) =1, 


(17.10) H («/B1B2) =H for special a, 
(17.11) P (Bi, =P (BiB2, 


A corresponding result was proved by Maass [22; p. 736] using a Lemma of 
Siegel. Actually, a direct proof (like the one in the case of elementary 
Gaussian sums [20]) can work if we strictly observe that we must have 
aB8€ D>, in our choice of a, and @, in (17.10). 

Thus if we list the prime divisors in R(D4), denoted by p, we find, by 
unique factorization, 


(17. 12) P(u) HP 


where, 
(17. 13) Py(u) = 1+ EP (p',u)/| 
On the other hand, as s—>0, when p>) 0, 
(17.14) B(p, 7) (6/2) ] e(ur) 


by use of the [-integral. Thus, making use of uniform convergence majorants 
[21] as s—>0, we find, when k —3 


= 2 e(ur)Z(u), 


(17. 15) 


S 
t 


HARVEY COHN. 


We have to next show ©*=wW and then to simplify P(z) to the point 
where it is recognizable. We do the latter first. 


Quadratic Residue Properties. 


18. Basic formulas. It is apparent, by now, that we can deduce most 
of the results on H(a/B) by analogy with Gaussian sums. We can, of course, 
restrict ourselves to 8 a prime power. 

First we take z an odd prime in R(D*), (+r) A (mn). We note r—q if 
(D/q) =—1, for q a rational prime and r7’—p if (D/p) =1, for p a 
rational prime. We now standardize z by the condition 7r€ Dz, at least to 
within the factor +“, (w integral), (by choosing ex if r¢ 2). Thus, 
for D=8, =3,5,1+ 2-24,11,13,5 + 2-24 --, ete. For D—12, r=3 
+ 2-34, 5,7,1+2-38,5 + 2-34, ete. (Note, (3—2-34) —— (3+ 2-34)e*) 
The primes, of course, will not necessarily have positive norm. 


Next we observe a “polygon” theorem: 


N if «/B is an integer, 
{mod B 
The proof consists of showing if «/8 is no integer, the sum is unchanged by 
multiplication by e(2f.%/B8), for a properly chosen {)). Using this, 


we consider 


0 otherwise. 


(18. 2) H(a/rt) = > e(v?a/m*), aE Da, 


ymod 


We set + v2, (v, mod z,vmod z**), and we discover 


H(a/rt)= e(ve2a/rt) S 


(18.3) H(a/nt) =|N(m)|H(a/nt*), 


Since H (a) —1, trivially, we concentrate on H(a/r). 
We next distinguish the quadratic residues modz by introducing the 


symbol 
1 if modz, (A, 7) —1, 
(18. 4) (A/r) if modz, (A,7) — 1, 
0 if (A,r) 1. 


Then if we consider p the residues mod (p/x) and the non-residues 
mod z, (v/7) =—1, we form 


R=>e(p/z), 
(18. 5) N e(v/z), 


42 


CALCULATION OF CLASS NUMBERS. 


since we can easily make p€ ©, (by adding we). Clearly 
(18. 6) H (1/7) =1-+ 2h, 
(18.7) 0=—1+R+N, 


the latter following if we select p and v from the complete set of residues 2€, 
(¢ varying mod7), using (18.1). Thus, as in the Gaussian case, 


(18.8) H(1/r) —R—N = 3(a/x)e(a/n), 
(18. 9) H(a/m) = (a/r)H(1/m), if 

We now need to evaluate only H(1/). By quadratic reciprocity (15. 15); 
(18.10) (1/r)/N (—z/1)/1—=sgn (1,7). 
Hence, for odd t, 


while, for even 


(18. 11b) H(a/mt) =| 


Next take the even prime, 7. For 8 =7', we find that the expression 
H(a/B8) makes sense, with (a, 8) exactly when ¢=>2. Even so, using 
the polygon theorem (18.1), we find by an argument similar to the one in 
the last section, 

(18.12) H(a/y') =2H(a/y**) if t24. Here need not belong 
to D2. Thus every value of H(a/y‘) can be computed directly from these: 


(H (1/42) = 2; H(1/y%) = 2- 


24 
H((1 2) E 3) 
(18.18) ((1-+ (3/y°) = 2 2h, 


— 


In cases where two values ( :) are bracketed, the upper one belongs to 


D=8 and the lower one belongs to D—12. 
We also note the further results: 


(18. 14) t24, 
(18. 15) H (a/n? +») =H 
(18. 16) H +4) =— H (a/y°). 


43 


HARVEY COHN. 


The first follows from the observation that in the sum (15.18) for 
H(a/nt) only the even vy enter, the odd v cancel out to make possible (18.12). 
By formula (17.10) we can show, finally, 


(18.17) | H(4/B)| =| 
if aB€ and is reduced. 


19. Quadratic reciprocity. The epitome of the theory is the deduction 
of quadratic reciprocity by the method of Hecke. Our principal variation 
consists in the restriction in H(a/B). 

If +, and zz are odd primes in ©, by using (15.15), (18.9), and (18.10): 


(19. 1) (21/72) (22/71) {m1, 12}, 
where {71/72} =sgn(1,7,)sgn(1,72)sgn (m1, r2)sgn N (71), or 
{71,72} =—1 under the array of signs: 
+ =, mm.>>0;N <0,N (a2) <0; 
— —, <<0;N(m) <0; 
+ =, m<<0;N (me) <0; 
{71,72} =-+ 1, otherwise. 


Then the first completion theorem also emerges in the process: 
(19.3) (—1/r) =sgn N(x). 


We can deduce the rational results: If + p denotes rm’, p>O, then 
€ + 1=0modz is solvable exactly when x? + 1=0 mod p is solvable. This 
implies, in rational symbols, (D/p) = (—1/p) =+1, hence p=1mod D 
for D8 and 12. Thus the elementary theory of quadratic forms yields the 
following expression, for an arbitrary p=1(mod 8): 


(19. 4) p=u,’?— 8v,? = =U, + 2- 
and one (not both) of the following for p=1 (mod 12): 


Us" 12v,? = N (2-2), 2 =U, + 
P | — = N (m3), = 2u3 + 3303. 
Clearly, 73, 72, and (2+ 34)z, all € Ds, verifying (19.3). Conditions (19. 4) 
and (19.5) are clearly necessary and sufficient for (—1/r) =1, when z is 


(19.5) 


not a rational prime. 
A less simple case is the second completion theorem. We apply the 
reciprocity formula (15.15) to H(»3/—-7) and obtain a set of values of 


E 


CALCULATION OF CLASS NUMBERS. 45 


(n/—7) as related to H(+ which is determined by z (mod 2y*), (not 
mod 7’, by (18.16)). Since the steps are finite in number, we omit tedious 
details : 


(19. 6) =sgn(1,—)sgn (9°, — 2) H 28, D2). 
Thus from formulas (18.13) and (18.16) for D=8 or 12: 


(19.7 +sgn7’ if r=1,1-+ 7°, 3,3 +7 (mod 
—sonn’ if r=5,5+ 7°, 7,7 + 7° (mod 4). 


The final completion law is found by doing likewise with H (en*/—7) 
and finding (en/r). Omitting details, we note 


(19.8) (en/w) = sgn (1, sgn — 7) H /2 24[— N 
(rE Do). 


Thus, as before, with « defined in (14.2), we find the remarkable result for 
D 8, D>, 


(19.9) (e/r) if r=1,3+ (mod 4), 
(19. 10) (e/r) =—senw’ if r=1-+ 27,3 (mod 4), 
and for D—12, r€ Do, 

(19.11) (e/r) =1 if r=1,3 (mod 4), 


We note that lines (19.10) and (19.11) represent the odd residue classes for 
which r==— mod 4. The pattern emerges again for the table in § 12 (above). 


20. The zeta series. We next consider the various series Pp(), starting 
with P(p*,n), of §17. 
If w is odd, in expression (17.8) for P(',u) we note if a and r€ Dz, 
then we have simply (for k=3), 
P(xt,u) t21) 
(20. 1) 
| W(x) |*/?[sgn* N (x) ]*/#sgn‘ (1, 2 _, 
a@mod 


The inner sum is recognizable again as a character sum, not unlike (18.8), 
except for the fact that + could divide ». Let us write 


(20. 2) = 1, (m=0), 


| 


46 HARVEY COHN. 


then we find after an elaborate effort: 
{ 0 t odd 


(2.3) P(m',n) =4 t odd 
— | N(x) ¢ even m+1, 


0 t>m+1. 


The interesting cases are where p is square-free so m=0 or 1. In either 
case, following a familiar pattern [12, p. 190] 


(20.4) =(1—| N@) |), 


The contribution of 7 is more complicated. First of all, the analogue of 
(20.1) is the more complex expression 


P(t,n) 
(20. 5) 


hs 


+ H*(a/n* + 9) wa/n’ — yn). 
We therefore see that if » is odd, wE€ Ds, by (18.14), P(n*,n) =0 if t24. 
The problem is then finitary and details can be omitted. We introduce the 
characters 
(Clearly Q=—1 necessarily if »¢ D2). Since w>>0, this character is 
consistent with (19.9), etc. (if *——vyp formally). We introduce the 


further character, 
1 if —p=£ (mod 4y), » odd; 

(20.7) Q(—yp»,7) = 3—1 if (mod 4y) but —p= (mod 4), odd; 
0 if mod 4, or (p, 7) £1. 


The characters Q(—p,«) and Q(—vp,7) are like the rational characters 


(—1/p) and (p/2). 
Now we can verify that if » is square-free, and w>> 0, we Do, 


(20. 8) = 

(20. 9) P (n°, w) = 1289 

so that for »€ Dz, square-free, totally positive, Py(u) —Po(u), where 
(20. 10) Po(u) 2. 


We more generally consider py? defined according to (12.3) to cover cases 
where »¢ D2. Then it can be seen 


? 


CALCULATION OF CLASS NUMBERS. 


(20. 11) Py (uy*) = Po(u). 
We consider terms of the special type with coefficient defined as shown: 
(20. 12) V(r) = De(uy’r)Z (uy?) +: 


We then make a series of substitutions to obtain a convenient formula for 
Z(uy*). The details are lengthy, but we might just dwell on properties of 
the field R(D3, (—,)*) as a relative quadratic field over R(D+). Thus the 


zeta function 
(20.18) J[[(1—| = 162*C/D*/?, = 1/48, = 1/24), 
from §5. We also note the relative zeta function [18] for the adjunction of 
(—p)4 to R(D)3 is 
II (1— (1 — Q(— 9) /2)* 
(20.14) todd 
H (D4, —y)4) (2n°/w) / (log | |/log | |) (4/D)}, 


where A is the discriminant of R(D4, (—4)4), w is the number of complex 
roots of unity and # is the fundamental unit of this larger field. We can 
now write, if we let s—>0, in the terminology of (12.2), 


(20.15) =H (DA, 


where, for square-free, totally positive p, 


(20.16) 


log | € | 
2\ 7)2)\3 
Cp(A/N (py )D ) log | E | 


This is substantially the G of (12.2), if we find A according to the usual 
theory of relative-quadratic fields. Referring to (20.6), the relative basis is 
(for D = 8,12), 


(1, (—#)*)/y], if =—1, pE Ds, 
[1, (—a)4], if D>. 
Of course, in the first two cases y —1, while in the third N(y) =2. To com- 
plete the explanation, we might add that w—2 and |«|—|#| except in 
cases cited in (12.4), which we exclude. 

As a concluding remark to this section we observe that the progress from 
formulae (17. 12-17.15) to (20.15) depends heavily on Gaussian sum manipu- 
lations. Here many details were not needed, particularly those typified by the 


(€+ (—)4)/2], if €) = 1, 
(20.17) 


48 HARVEY COHN. 


reduction of such sums to “normal form” and the use of the “ zero-rule,” 


as, for example, in Hasse’s work [16a; p. 14]. The zeta-function manipu- 
lations, however, are still best derived as ad hoc extensions of Maass’ work 
[12]; while the identification of the modular forms will be done by a more 
direct method, starting in § 22 below. 


21. Singular series for four squares. Before proceedings to the proof 
that ©? — W, we note in retrospect how the singular series could be developed 
for the fourth power. In the earlier paper, [16], formula (51a) yields, 
essentially, for D = 8,12, that the singular series for @o,o(7) is 


(21.1) Qoo(r) = {441 (7) + 6440(47) + 440(7) —24A0(2r) +1 
where 
Ax(r) =Lim [NW (vr + 4) | +2) | 
(21. 2) 2), 
(v) A (0), 


and 
(21.3) Cp’ = 


It is no great difficulty to evaluate the singular series (16.1) for k—=4 
and obtain the matching series 


(21.4) W(0,0;7;4,s)=1+ > 
a/B,aB EDs 


This can best be done using formulas (17.10), (18.11), (18.13), which 
assures us 


(21.5) H*(a/B) =N(B)?. 


We pause here to note that the summation condition “a8€ ©. for a/B 
reduced” actually encompasses the rather complicated coefficients in (21.1). 
In fact, examining Gotzky’s work equally critically we note, for D=5, 


(21.6) [Se(v?r) ]* = {84o(r) + 1284,(4r) —16A,(27) + 1. 


Again, this fairly complicated expression of Gétzky could have been written 
as (21.4), (s—>0), with the restriction a/8 be reduced and a and B not be 
both odd. (The last restriction, of course, enters Jacobi’s well-known demon- 
stration involving four rational squares. ) 


| 
c 
| 


CALCULATION OF CLASS NUMBERS. 


Equality of Theta Series and Singular Series. 


22. Modular forms. The functions V(d,c;7,3,0) and @*(d,c;r) 
satisfy the same functional equations (14.11)-(14.15) and (16.3)-(16.7). 
Are they proportional ? 

We revert to the procedure of §7%. We note that @(d,c;7) can vanish, 
within its functional domain, only on a well defined zero-manifold. This 
manifold exists only for d—=c=1 and is the complex (two dimensional) 
manifold 


(22.1) v=0 


in new complex cooridnates (u,v) defined as follows: 


22.2 
( ) 


T=u+2, 
(22.3) =—u-+v, for D=12. 


Furthermore, on this manifold, the zeros are simple, (see § 7), or, 
(22. 4) 00 (u,v) /dv for all u, (at v= 0). 


We need only show that (d,c;7,3,0)/@°(d,c,7) has no poles in the 
fundamental domain by establishing a third order zero for ¥ when d=c—1 
and We can draw our conclusion (that and are proportional) 
from Theorem 4 (§ 2). 

We define =(d,c;7) as a system of four modular functions (as d, c vary) 
in + and 7’, having no finite singularities and satisfying the system (16.3)- 
(16.7) with k —3, (dimension —3/2), and s=0. Then we conclude 


(1,1,—¢?/r?) = E(1,1,7) 
22.5 > 
2(1,1,7—¢) =— 1,7), for D—=8; 


1,—1/r = = (1, 1,7) N(r)*/?, 


22. 
( £(1,1,7-+7—1) =— 1,r), for D=12. 


We next define 


Then let j denote the integer = 0, for which 
(22.8) Ty (wu) (u) =: -=T;(u) =0; Ty (u) 
Then since V(r) —u?, on the zero manifold while 0/dv = + 0/dr’, etc., 


for t <j, 


4 


49 

l- 
k 
f 
d 

| 

| 

| 


50 HARVEY COHN. 


(22.9) T;(—1/u) =— 11; (wu) 
T;(w) > 0, as for D=8; 
(u + 3#) =—T;(u), 
(22.10) 1/u) =— 1; (uw) 
T;(u) > 0, as u—>ico, for D—12. 


If we could show 7=3, we shall then have established that y— 6’. 
We can do this only for D = 8 (and with an additional bit of special informa- 
tion, distinguishing this case from D—12, as we now develop). 


23. Special Fourier expansions. We first shall estimate the order of 
magnitude of T;(w) as u->oo. To do this we expand ¥(1,1;7;k,s) by the 
Poisson-Lipschitz formula as in §17%. The only difference is that, whereas 
earlier H(a/8) = = + 2v), now we must use 


(23.1) + 2v) = e(m?/2) (2/8) 


Thus we must insert the factor e(vn?/2) into the numerator of the fraction in 
the summation of (17.2). This produces the result (in contrast to (17.15)), 


(23. 2) = e(u*r)8x°P* (nu) D-*)8, 

where 

(23.3) pe Ds, 

and 


where H,,, has meaning, of course, only when 


(23.5) (a+) (B+1) € Ds. 


We now have to separate the cases: When D = 8, we write, in the notation 
(22. 2), 


(23. 6) 
> [N (u*) riu(a—b + 4) -expribv: P*(y*), 
with the restriction 
(23. 7) =a-+ b24+ 4 550. 
When D=12, we write in the notation (22.3) 
(23. 8) 
— (2x*/3) (u*) exp riu(a + b)/34- exp + P*(u*) 


ion 


(u*) 


CALCULATION OF CLASS NUMBERS. 


with the restriction 


(23.9) (a+b) + 0. 


Now if we are interested in the behavior of I(u,v) in u, (as u—->10), 
we must arrange y* in order of increasing values of a—b+4 when D=8 
and a+b when D—12. Thus, for D=8, the first few terms are 


= 1/2, 3/2 + 2853/2, 5/2 + 24, 7/2 4+2-28,9/2 43-28; 
and 
(1,1375;3,s) exp(miu/2) [P*(4) + exp wiv: + 23)] 
+ exp (3riu/2) [8/2P* (3/2) + 178/2P* (5/2 + 2 )exp riv 
4. 178/2P* (7/2 + 2-2h)exp wiv + 3/2P* (9/2 + 3-24) exp 3ziv] 
For D = 12, the first few terms are 


(23. 10) 


p* 1 + 34/2, 1 — 34/2; 2 + 38/2, 2 —34/2;-- 


and 
PAC + 3/2) + exp(—niv/2) —3/2)] 
+ 134/2 exp(2miu/3#)[P*(2 + 33/2) + exp(— niv/2)P*(2 — 34/2) ] 
We next observe that for D = 8, 
(23.12) P* (yp) =0 if a=b mod?2. 


This is true because of a symmetry in the formula (23.4). For instance, if 

has the property (23.5), so does «4/8 + (1+ 7). Now when is 

augmented by y, is multiplied by 
=—e([a+}]y), 


which equals — 1 when a=bmod2. Thus all terms of type exp ri(4s —1)u/2 
vanish in (23.10), and, in particular, differentiating with respect to v (at 
'=0), we find, condition (22.9) supplemented by the strong result 


(23. 13) T;(u) = O(exp 3riu/2) as u> io, 


if we start by letting == W(d,c;7;3,0) in equation (22.7). 
Turning our attention to D = 12, we are less fortunate; 


(23.14) P*(u*) =0, if mod 2, 


51 
0%, 
na- 
of 
he 
eas 
in 
| 


52 HARVEY COHN. 


since becomes multiplied by + e([a-+b]y) when 
a/B is augmented by 1+ 7. Thus all terms of type exp 2¢riu/3* vanish in 
(23.11). Thus (22.10) can be supplemented only by the comparatively 
weak rsult 


(23. 15) = O (exp wiu/34), 


if we start with 2—wW in equation (22.7). 


24. Completion of the identity for D = 8. We first take the case D = 8 
where the rational modular group has the famous Klein invariant J(w). 
The factors —i in system (22.9) serves to forecast the presence of radicals. 
To simplfy matters, we take fourth powers: 


(24. 1a) 
(24. 1b) (uw) 
(24. 1c) T,*(u) = O[ J-*(u)] as u> 10, 


the last expression coming from (23.13). Thus 
(24. 2) (w) (u) (u) (u) — 
where, in the notation of “integral part” [- - -], 
(24.3) t, =4-+ [8¢/3], tp = 2¢+ 3, 
and the degree of f is St, -+ ¢t,—4¢—9, by the usual method of Poincaré 
[2]. 
Beyond these considerations T;*(w) must behave like a perfect fourth 
power near every finite complex value of wu: 


t t. te max degree f 
0 4 3 —2 
1 6 5 —2 
2 9 7 —l1 
3 12 9 0 


Thus the first instance is, at t= 3, 
(24. 4) = const. J’(u)*8J (uw) (wu) —1]~. 
The right-hand member indeed is a perfect fourth power. This will not 


necessarily imply that T',(w) transforms with the factors required in (22.9). 
(This requires continuation.) We have nevertheless established that in the 


are 


rth 


CALCULATION OF CLASS NUMBERS. 53 


terminology of § 2%, 723 for D—8. Thus W/@* is free of singularities and 
finally by Theorem 4, 


(24.5) (d,c57;3,0) —@%(d,c;7), 
(24. 6) Az(u) =Z(u). 

We can justify anew the assertion that 
(24. 7) (uw) = J’ (u)*/?#[ J(u) (u)-3 


must necessarily satisfy the properties (22.9). For instance, such a I, (w) 
exists because ® has only a simple zero and starting with Z—wW in (22.7) 
we must necessarily arrive at a T3540. 

Continuing along the lines of the earlier part [16], we turn our attention 
to dimensionalities: The modular forms E(d,c;7) described by equations 
(16.3)-(16.7) with k = 3, s=0, are of dimension one if they are bounded at 
infinity. To see this, we note that even in the absence of the special informa- 
tion in §23, we can conclude that T;(w) =O(J-/*(u)), from equations 
(22.9). Yet we could conceivably have more forms T;*(w), since the degree 
of f[J(w)] can now be We still insist must be 
single-valued in wu regardless of how it transforms under the modular group. 
We then find these additional forms for Ty‘, T',*, T'3* are the only possibilities : 


J’(u)®/J (u)*[ J(u) —1]8, J’(u)**/J (u)® [J (uw) —1]*, 
(u)? [J (u) —1]*. 


Even though these are fourth powers, their fourth roots are of the form 
(J(u) —1]4P,*(u) (Rat. Funct. of J’, J). Thus since [J (uw) —1]4 acquires 
a sign change when u>w--1, it follows that the only solution to system 
(22.9) is still (24.7). 


25. Failure of the identity for D—12. Here we consider the trans- 
formation (22.10) for which Hecke’s invariant J(u) is most natural (see 
§10). It has the properties 


I(u + 34) =I(u), 
I(—1/u) =I(u), 
(25.1) I(u) = const: exp — 2riu/34+ O(1), asu>io, 
I(u) =const- (u—i)?+1+ 0(u—i)%, as ui, 
| I(w) = const: (u— exp 4+- O(u—exn 
as U—> exp 


n 
in 
ly 

8 

). 
ls. 
|_| 
|_| 
not 
9). 
the 


54 HARVEY COHN. 


We note 


+ 34) = T;*(u), 
(25. 2) (—1/u) (wu) 
= O[I(u)-?] as u> io. 


Then by the usual method 


(25.3) (u) =I’ (u)****f [I (uw) —1]-*s, 
where 
(25. 4) =5-+ [10¢/3], t, = 2t+ 3 


and f is a polynomial of degree = ¢, + ¢, —-4¢—8. We then obtain the values 


t t, te max degree f 
0 5 3 0 
1 8 5) 1 
2 11 7 2 
3 15 9 4 


While there are now plenty of polynomials, more important, many are fourth 
powers in the w-plane. For ¢=0,1,2 we have the following exact fourth 


powers: 
(24. 5) 

(24. 6) P,*(u)* =I’ (u) (u) — 1], 
(24. 8) (uw)? [I (wv) —1}*. 


We can verify that T)* (uw) = const + O(u—7) hence comparing behaviors 
at u=1, does not satisfy the required sign condition, i.e., (—1/u) 
Moreover, and ~const(w—z), hence they satisfy 
conditions (22.10), but must be excluded since + 34) = + 
from its order at ic%. Thus the modular forms H(d,c;7r) described by equa- 
tions (16.3)-(16.7) with k=3, s=0 are a vector space of dimension two, 
or three if they are bounded at infinity. 

We know that ¥(d,c;7;3,0) AO@*%(d,c;7) because Z(1) =8A;(1) 
= 6, so the coefficients of e(7) do not match when d=c=0O. The third 
independent solution of the = system is not apparent, if it exists at all. 


26. Rational results. If we take the coefficient in W for e(x*ur), 
namely Z(z*u), there is a very simple relation with Z(y) in the case where 


1 

( 

h 

W 

a 

al 

m 


h 


CALCULATION OF CLASS NUMBERS. 55 


7+€ 2 is an odd prime not dividing the square-free totally positive ». Using 
formulae (20.3) and (20.4) we find 


Pe N(x N (a 0, 


This is the only factor of P(y) in (17.12) that changes, and the value of 
N(u) of course also changes when yz* replaces ». Thus it is easy to see that 
for D = 8, 12, 


2) — §4(u)| N(x)|, N(x) > 0, 
= +21, Wee) <0. 
In the case where D=8, Z(u) = A3z(u) and (26.2) gives us a theorem on 
the comparative number of representations of » and a’ by three squares. 
Such theorems would not be easy to prove directly. 
For example, we might start with A;(1) —6 and a prime 


(26.3) p=8K +1, 

so that in R(28) 

(26. 4) + 

Then using some of the results of (19.4) and (26.2) we learn the equation, 
in R(24), 

(26.5) m= + + 

has 48K non-trivial solutions (the triviality 7? 7* counting for six addi- 
tional ones). If 


(26.6) p 


ll 


+ 3 (mod 8), 
so that p is prime in R(24), the equation 
(26. 7) p? = + + 


has 6(p?—1) non-trivial solutions. 

It would be irrelevant to our goal to attempt to categorize the ensuing 
wealth of non-analytic results. 

In concluding, we might note that starting with hyperelliptic integrals 
and using Hermite’s analysis, Humbert [19] discovered that D = 5,8, and 12 
all enjoy a favored role in theta-function theory, but he did not press the 
matter far enough to discover any number theoretic results of intrinsic interest. 


UNIVERSITY OF ARIZONA. 


HARVEY COHN. 


REFERENCES. 


[16] H. Cohn, “ Decomposition into four integral squares in the fields of 2! and 33,” 
American Journal of Mathematics, vol 82 (1960), pp. 301-321. 

[16a] H. Hasse, “Allgemeine Theorie der Gaussschen Summen in algebraischen Zahl- 
kérpern,” Abhandlungen der deutschen Akademie der Wissenschaften zu 
Berlin, Math.-Nat. Klasse (1951), no. 1. 

{17] E. Hecke, “ Bestimmung der Klassenzahl einer neuen Reihe von algebraischen 
Zahlkérpern,” Nachrichten der Ké6niglichen Gesellschaft der Wissen- 
schaften zu Gottingen, Mathematisch-physikalische Klasse (1921), pp. 1-23. 


[18] , Vorlesungen iiber die Theorie der algebraischen Zahlen, Leipzig (1923), 
chapter 8. 

[19] G. Humbert, “ Sur les fonctions abéliennes singulierés d’invariants huit, douze, et 

cing,” Journal de Mathématiques Pures et Appliqueés, ser. VI, vol. 2 
(1906), pp. 329-355. 

[20] E. Landau, Vorlesungen iiber Zahlentheorie, Leipzig (1927), pp. 167 ff. 

[21] H. Maass, “ Konstruktion ganzer Modulformen halbzahliger Dimension mit 6- 
Multiplikatoren in einer und zwei Variabeln,” Abhandlungen aus dem 
Mathematischen Seminar der Hansischen Universitat, vol. 12 (1938), pp. 
133-162. 

, “Konstruktion ganzer Modulformen halbzihliger Dimension mit #-Multi- 
plikatoren in zwei Variabeln,” Mathematische Zeitschrift, vol. 43 (1938), 
pp. 709-738. 

[23] L. J. Mordell, “ Note on class relation formulae,” Messenger of Mathematics, vol. 

45 (1916), pp. 76-80. 


[22] 


POINTS MULTIPLES D’UNE APPLICATION ET PRODUIT 
CYCLIQUE REDUIT.* 


par ANDRE HAEFLIGER. 


Le but essentiel de cette note est de déterminer la classe de cohomologie 
universelle modulo p duale au cycle des points p-uples d’une application f 
d’une variété V dans une variété M, p étant premier et les points p-uples 
étant considérés comme des points du produit cyclique de V. Cette classe 
peut aussi s’interpréter comme une obstruction 4 trouver dans la classe d’homo- 
topie de f une application sans point p-uple. Elle est en relation étroite avec 
la classe de plongement ®, de Wu (cf. [13]). 

La méthode utilisée donne une détermination explicite de la cohomologie 
modulo p du p-produit cyclique réduit V*, d’une variété V. C’est dans la 
cohomologie de cet espace que se trouvent des obstructions au plongement de 
V dans une variété M (cf. [13] et [7]). Nous retrouvons les conditions 
données par Wu pour l’annulation des classes ®, lorsque M est l’espace 
euclidien. 

Je tiens 4 remercier vivement le Prof. N. E. Steenrod qui m’a commu- 
niqué ses résultats avant leur publication. 


1. Définitions. Ce paragraphe rappelle essentiellement les notions 
introduites dans [5]. 


1.1. Point p-uple de type w dune application. Soit f une application 
continue d’un espace V dans un espace Jf. Un point p-uple de f est une 
suite de p points distincts de V tels que =: -—f (ap). 
Mais il est naturel @identifier deux telles suites si l'une se déduit de l’autre 
par une permutation. Soit done 7 un groupe de permutations de p objets; 
il agit sur le produit V? par permutation des facteurs et, par restriction, sur 
le sous-espace V?, formé des suites de p points distincts de V. Soit V°r le 


quotient de par l’action de 7: deux points +,@p) et -,2’p) 
de V?, définissent le méme point {2,,: - -,2p} de V°m si l’une des suites est 
obtenue 4 partir de l’autre par une permutation appartenant 4 z. Par défi- 
nition, un point p-uple de f de type est un point - 2p} de V°x tel que 
f(t.) =: =f (ap). 


* Received June 8, 1960. 


il- 
en 
n- 
3, 
); 
et 
2 
m 
p. 
i- 
57 


ANDRE HAEFLIGER. 


1.2. Classe universelle des points p-uples. Supposons que M soit une 
variété de dimension m. Soit Hx—> Bz un espace’ fibré universel de groupe 
structural +. Soit Mx le fibré Hx Xa M? associé & Hx de fibre M? sur laquelle 
7 agit par permutation des facteurs. La reunion des diagonales des fibres 
forme un sous-fibré trivial Mz — Bz X M: le sous-fibré diagonal. 

On peut définir la classe As duale 4 Mz dans M?q_ de la maniére suivante. 
Soit E’s— B’r un fibré universel de groupe 7 pour une grande dimension N 
et tel que E’x et B’x soient des variétés; alors le fibré M’?, associé 4 E’s de 
fibre M? est une variété ainsi que le sous-fibré diagonal M’s. Soit A’“« la 
classe duale 4 la classe d’homologie de M’?,; représentée par la sous-variété M’; ; 
cest un élément de H®-»™(M’?,), les coefficients étant les entiers modulo 2 
si M est non orientable ou un faisceau d’entiers tordus si M est orientée. 
Il existe une représentation définie 4 ’homotopie prés de M’?, dans M?-,; elle 
induit un isomorphisme de H?-)™(M?,) sur H@-)™(M’?,) si N est assez 
grand. Par définition sera l’élément de H®-™(M?,) correspondant a 
par cet isomorphisme. 

La classe A“; (ou plus simplement Ax) sera la classe universelle des 
points p-uples de type m pour les applications continues dans M. Cette termi- 
nologie est justifiée dans les paragraphes qui suivent. 


1.3. La classe Olg. L’espace V% (cf. 1.1) est un revétement de V°x 
et peut étre considéré comme un fibré principal de groupe structural 7. 
Soit alors comme plus haut £ le fibré associé 4 V% de fibre M? et soit D le 
sous-fibré diagonal formé de la réunion des diagonales des fibres. Toute 
application continue f de V dans M définit une section f°, de EF et le point 
*,%p} de est un point p-uple de f si et seulement si fr(z) 
appartient 4 D. 

Si V est un espace assez raisonnable (par exemple un polyédre), il existe 
une représentation h de dans Vuniversel unique 4 l’homotopie prés. 


Par définition la classe est égale a Il est clair que 
ne dépend que la classe d’homotopie de f. 


Proposition. Si f est homotope a une application de V dans M sans 
point p-uple,alors Ol, = 0. 


En effet, si f est une application sans point p-uple, alors hf°, applique 
dans M?,— M,; d’autre part l'image de par ’homomorphisme induit 
par injection de M?,;— M, dans est nulle, car A“z est représentée par 
une cochaine de support 
La classe Of, représente dans un certain sens une premiére obstruction 


POINTS MULTIPLES D'UNE APPLICATION. 59 


i trouver dans la classe d’homotopie de f une application sans point p-uple 


(cf. [7], [13]). 


1.4. Interprétation de la classe Ol dans le case différentiable. Soit f 
une application différentiable d’une variété V dans une variété M présentant 
sous forme réguliére les points p-uples, c’est 4 dire que pour tout point p-uple 
de f les images par f des espaces tangents 4 V en sont 
en position générale dans l’espace tangent 4 M en y=f(z,) =: - -=f (2p). 
Ceci équivaut 4 dire que f?: V?—> M? est transverse (cf. [11]) sur la diagonale 
de M? en tout point de V% C V?, ou encore que la section f°s du fibré E est 
transverse sur le sous-fibré diagonal D. Dans ces conditions, les points p-uples 
de type w de f forment une sous-variété dans Vx représentant une classe 
Vhomologie duale Cela résulte immédiatement de la définition de 
car h*(A™,) est la classe duale 4 D dans £. 

On peut remarquer (cf. [5], I, 3) que l’on peut toujours approcher une 
application donnée par une application différentiable qui présente sous forme 
réguliére les p-uples sur un ouvert de V°x qui en est un rétract par déformation. 


1.5. Produit cyclique réduit V*, et classes de plongement ¢/y. Soit 
p un nombre premier. Suivant la terminologie de Wu [13], appelons p- 
produit cyclique réduit de V le quotient V*, de V?—V (p-produit de V 
privé de sa diagonale V) par l’action du groupe a des permutations cycliques 
des facteurs. Comme p est premier, l’espace V?—V est un revétement 4 p 
feuillets de V*, et peut étre aussi considéré comme un espace fibré principal 
de groupe structural 7. On peut done construire comme plus haut (1.2 et 
1.3) le fibré associé (de base V*,) de fibre M? et l’on a une représentation h 
de # dans luniversel 1/?;; toute application continue f de V dans M définit 
une section f, de £. 


Par définition la classe H®-)™(V*,) est égale a 


On remarquera que Of, est l’image de ¢/, par ’homomorphisme induit 
par l’injection du sous-espace dans V*,. Naturellement pour p= 2, 
Ole = 


PROPOSITION. Si f est homotope a un plongement de V dans M, alors 
= 0. 


La démonstration est la méme que celle de la proposition précédente. 

Les classes ¢$’, généralisent au signe prés les classes de plongement 
go”), définies par Wu lorsque M est l’espace euclidien R™ (cf. [13], II). 
On pourrait aussi définir, suivant Wu (cf. [13], II), les classes d’im- 


60 ANDRE HAEFLIGER. 


mersion y/, en prenant l’image de ¢‘, dans la limite inductive des cohomologies 
des voisinages de la diagonale dans le p-produit cyclique réduit de V. Sif est 
homotope 4 une immersion (c. 4 d. une application localement biunivoque), 
alors 


2. Résultats de Steenrod. Dans tout ce qui suit, p est un nombre 
premier et 7 est le groupe des permutations cycliques de p objets; 7’ est un 
générateur de z. Tous les groupes de cohomologie considérés sont a coefficients 
les entiers modulo p. 


2.1. L’anneau de cohomologie du classifiant Bz pour le groupe 7m sera 
noté H*(x) (cf. [38], Chap. XII). 


Pour p=2, H*(x) =P(u), ot est Panneau des polynomes sur 
Z. dans la variable »€ H*(7). 

Pour p > 2, H* (x) =P(u)@ (vr), ot est ’anneau des polynomes 
sur Z, dans la variable »€ H*(7) et H(v) Valgébre extérieure engendrée par 
un élément v€ H*(7). 

La cohomologie de tout espace fibré g: HB a groupe structural z est 
une H*(7)-algébre: soit h V’application (définie 4 une homotopie prés) de 
B dans qui définit la classe de 7; six€ et H*(x), par définition 
ar == g*h*(a)U 


2.2. Calcul de H*(M*z). Soit M un complexe fini et soit M?, le fibré 
associé 4 Hz de fibre M? sur laquelle + agit par permutations cycliques des 
facteurs (cf. 1.2). 


THEOREME (Steenrod). existe un tsomorphisme naturel de H*(r)- 
algebre H* (Mz) > H* (a, H*(M)?). 
La projection naturelle de H*(z,H*(M)*) => H"(a,H*(M)?) sur 
r=0 


H°(x,H*(M)?) tdentifié au sous-groupe des éléments invariants de H*(M)? 
= H*(M*) correspond 4 Vhomomorphisme r*: H*(M?x) > H*(M?) induit 
par injection de la fibre M?z. 


Dans cet énoncé, H*(M)? est consideré comme un z-module (c. & d. un 
module sur l’anneau Z(7) de =z), a agissant par permutation cyclique des 
facteurs du produit tensoriel H*(M/)? avec le changement de signe usuel 


(cf. [3]). 


2.3. Avant d’esquisser la démonstration, explicitons la structure de 
H* (x, H*(M)?) 


| 


61 


POINTS MULTIPLES D'UNE APPLICATION. 


Soit D le sous-groupe de H*(M)? formé des combinaisons linéaires des 
éléments diagonaux ot H*(M); le groupe 
laisse fixe les éléments de D. Comme a-module, H*(M)? est somme directe 
de D et d’un sous-module libre. Soit en effet (a;) une base de H*(M); 
les éléments a;,®---@®a;, forment une base de H*(M)?. Les éléments 
a;®- - -@a; forment une base de D et les autres éléments de la base engen- 
drent un sous-module z-libre que l’on peut écrire sous la forme Zp(7)@S, ou 
S est l’espace vectoriel sur Z, engendré par une z-base. 

Ainsi H*(M)?=D-+Z,(7r)@S8 et donc 


H* (x, H*(M)?) =H*(r)@D+N, 


ou NV désigne l’image de H*(M)? par l’opérateur norme V: 1+ 7-+---+ 
(cf. [8], Chap. XII). 

Le H*(xz)-module H*(x,H*(M)?) est done engendré par le sous-groupe 
D+WN des éléments de w-degré 0 (un élément de H"(z,H*(M)?) est de 
r-degré r); de plus H*(r) agit trivialement sur les normes N (cf. [3], 
Chap. XII). 


La structure multiplicative est déterminée en remarquant que les éléments 
de r-degré 0 forment un sous-anneau isomorphe 4 celui des éléments invariants 
de H*(M)? et que H*(x)®@D est une sous-algébre produit extérieure des 
anneaux H*(z) et D. On remarquera enfin que l’application de H*(M/) 
sur D faisant correspondre 4 un élément x sa p-éme puissance tensorielsle 
(x)? est un homomorphisme d’anneaux modulo N. 


2.4. La démonstration du théoréme est en gros la suivante (pour plus 
de détails, voir [9]). 

Soit W le complexe des chaines de B (résolution acyclique libre du z- 
module Z) et soient C,(M) (resp. C*(IM)) le complexe des chaines (resp. 
des cochaines) 4 coefficients Z, de M. Alors H*(M?,) est la cohomologie du 
complexe double Hom(W @zC*(M)?,Z,) qui s’identifie canoniquement au 
complexe double Homs(W,C*(1)?). Comme les coefficients sont dans un 
corps, il existe une équivalence naturelle (4 ’homotopie prés) entre le com- 
plexe de chaines C*(I) et H*(J1) considéré comme un complexe de chaines 
avec opérateur bord zéro; il en résulte également que les w-complexes de 
chaines C*(M)? et H*(M)? sont équivalents. Donec Homs(W,C*(M)?) est 
équivalent (comme z-complexe) au complexe simple Homs(W,H*(M)?) 
dont la cohomologie est par définition H*(7,H*(M)?). Il existe done un 
isomorphisme naturel ¢: H*(M?,) > H*(x,H*(M)?) de H*(x)-modules. 
La deuxiéme partie de l’énoncé résulte immédiatement de la définition 


= 

st 
e 
n 
r 
T 
st 
e 
] 
é 
p 
| 

n | 
s | 
e 


62 ANDRE HAEFLIGER. 


de ¢. Ceci montre que la restriction de ¢ aux éléments de z-degré 0 est un 
homomorphisme multiplicatif; comme H*(M)? est engendré comme H*(z)- 
module par ses éléments de w-degré 0 (cf. 2.3), il en résulte que ¢ est aussi 
un isomorphisme d’anneaux. 


2.5. L’homomorphisme i*: H*(M?x) > H*(Mz). Soit Mz le sous-fibré 
diagonal de M?, (cf. 1.2); il est isomorphe au produit Br x M. 

Avec l’identification du Théoréme 2.2 et Videntification de H*(Mx) avec 
H*(z)®H*(M), Vhomomorphisme 1* induit par linjection de Mz dans 
M?, sera un homomorphisme de H*(7)-algébre: 


i*: H*(x,H*(M)?) > H*(r)@H*(M). 


Il suffira de connaitre 1* sur le sous-groupe D+ N = H(z, H*(M)?) 
qui engendre H*(z,H*(M)?). Comme H*() agit trivialement sur N et 
que H*(M) est un H*(z)-module libre, on a 1*(N) =0. La valeur 
de 1* sur D est donnée par le 


THEOREME (Steenrod). Soit x un élément de H%(M) et (x)? 


q 
Ona p=2, 1*(x)? pt 
0 


[9/2] 
p> 2, i* (7)? = (— 1) D ty — > (— 1) Piz, 
4=0 4>0 


ow les P* (resp. les Sq‘) sont les p-emes putssances cycliques r-duites (resp. 
les carrés) de Steenrod (cf. [8]); B est Vhomomorphisme H*(M,Z,) 
— H*“(M,Z,) de Bockstein; h = (p—1)/2 et o = (—1)9/? ow (—1)@/?h! 
suwant que q est pair our impair. 

Pour la démonstration, cf. [9]. On remarquera que 1* est injectif sur 
le sous-module H*(x)® D. 


3. Détermination de la classe universelle des points p-uples. 


3.1. Soit M une variété connexe de dimension n, compacte, avec un 
bord B. Nous supposerons M orientée si p> 2. Pour tout nombre premier 
p et tout entier j positif, WU a défini (cf. [12]) les classes U4(p) apparte- 
nant a H*/@)(M,Z,) pour p > 2 ou a Hi(M,Z,) pour p= 2 par les relations: 

< Sqia, M >=< Ui (2) U a, M >> P= 2, 
< Pia,M >= < Ui Ua,M >, 


pour tout élément a appartenant 4 H*(MmodB,Z,); le symbole < y,M> 


63 


POINTS MULTIPLES D'UNE APPLICATION. 


désigne la valeur (indice de Kronecker) de y¢€.H"(M,Z,) sur le générateur 
de Hm(M mod B, Z,) défini par orientation de M. Remarquons que < y,M > 
=0 si rs4m. 

Pour rester dans le cadre de la théorie précédente, nous supposerons que 


M est un complexe. 


3.2. Tutorimer. La classe Ar modulo p duale aw sous-fibré diagonal M 
de M? est 


=m/2 
(U4 + 82, p=2, 
=0 
Smi2p 
Ar=A (U4) )? + 8p 
j=0 


ow 8, est la classe duale dans M? @ la classe d’homologie représentée par la 
diagonale, h = (pn —1)/2 et X= (—1)™/? ow (—1)/?h! sutvant que m 
est pair ou impair. 


Pour exprimer Az, on a identifié H*(M?x)a 
H* H*(M)?) => H*(r) (D+ N) 
k>0 
(cf. 2.3). 


3.3. DiMONSTRATION. Prenons pour Lx le complexe standard qui a p 
cellules dans chaque dimension 120: - -, Te, Voperateur bord 
étant défini par (14+ 7+: et = (1—T) Con. 

Remplagons le fibré Bx par le fibré obtenu en enlevant 
de Hz toutes les cellules de dimension plus grandes que (p—1)N—1, ov 
N>m. Nous obtenons ainsi une décomposition cellulaire de la sphére 
et est un espace lenticulaire de la méme dimension. Soit 
la restriction du fibré 

Le méme raisonnement que dans 2.4 montre qu’il existe un isomorphisme 
naturel de H*(z)-module ¢’ tel que le diagramme suivant soit commutatif: 


¢’ 
H* (M’?,) ———> H* (B's, H*(M)?) 
i* i* 
H* (M?x) ———> H* (Br, H*(M)?) 
ou les homomorphismes verticaux j* sont induits par V’inclusion de M’?, 


dans M?, et B’; dans Br resp.; j* est bijectif pour r< (p—1)N —1, injectif 
pour r== (p —1)N —1 et zéro pour r > (p—1)N —1, r désignant le -degré 


= 
| 


64 ANDRE HAEFLIGER. 


(cf. 2.3). De plus ¢’ est aussi multiplicatif (tout au moins sur l’image de 
j* puisque ¢ et j* sont des homomorphismes d’anneaux). 

Choisissons un générateur S de H (p-1)y-1(B’x, Zp) tel que S>=—1 
si p=2 et S>—1 si p> 2; soit ME Hn(MmodB,Z,) le géné- 
rateur défini par l’orientation de M. Alors les éléments S®M et S@M? 
sont des classes @’homologie qui définissent des orientations modulo p de M’, 
et respectivement. 

Soit Az la classe duale 4 M’s relativement a ces orientations. Pour tout 
xz € ), on a 


3. 4. <i*(z), 


car par définition ArN (S@M?) =i,(S@M) et Ar, S@OMP>= 
<2,4rN (cf. [10]). 

La formule 3.4 permet de calculer Ar modulo les normes WN en appli- 
quant mécaniquement 2.3 et 2.5. Montrons-le dans le cas p—2. Posons 
Ag = > modulo N, ot Vie Hi(M). Soit a€ H™*(MmodB) et 
soit D’aprés 2.5, on a i* (x) => = qia 
pour raison de dimensions. De méme, d’aprés 2.3, 


tUAc=> pN+2i-2j-1 (Via)? (Via) 
Done 


<1*(z), SOM >= < Sqiz,M> et Ar, SOM>—< Vie, M> 
Mot Vi=U iis). 
Un calcul analogue donne dans le cas p> 2 


DY (Vip) )? modulo N. 


05j=m/2p 
Pour déterminer Az, il suffit de connaitre so composante de x-degré 0 (cf. 
2.3). Remarquons pour cela que l’injection r de la fibre M? dans M’?, est 
transverse sur la sous-variété M’, et que 7-'(M’r) est la diagonale de M?. 
Donc r*(Azx) est la classe duale 8, 4 la diagonale dans M? (cf. [11]) ; comme 
r* est injectif sur les éléments de z-degré 0 et les identifie aux éléments 
invariants de H*(M?) (cf. 2.2 et 2.3), le théoréme est démontré. 


3.5. Classes caractéristiques de WU. Posons 


Wi 
j 


et 


Q* ip) —2 P*IU5 


it 


ol V est une variété. 


POINTS MULTIPLES D'UNE APPLICATION. 65 


Les classes W* sont les classes de Stiefel-Whitney de M et les classes 
Qi(p) € (notées aussi Y‘) sont les classes caractéristiques de 
Wu. Si J est une variété différentiable, ces classes sont des polynomes dans 
les classes de Pontryagin de M réduites modulo p (cf. [6]). 

Considérons la classes i*(Az) restriction de Ar 4 Mz; elle peut s’inter- 
preter comme la classe duale a la self-intersection de la “sous-variété” Mz 
de dans le cas différentiable, 1* (Az) est la classe d’Huler du fibré normal 
a Mz dans M?, (c’est 4 dire la réunion des fibrés normaux aux diagonales des 
fibres ou encore la limite du fibré normal 4 M’s dans M’?; lorsque N tend 
vers l’infini). 


8.6. COROLLAIRE. 
= > p=2 
i 
(Ar) =AD (—1) —AD (—1) ip), > 2. 
k k 


Ces expressions se calculent immédiatement en appliquant 2.5 et 3.2. 
Lorsque WM est une variété différentiable, alors BQ*(p) 0 car les Q* sont 
la réduction modulo p de classes entiéres (polynomes dans les classes de 
Pontryagin) (cf. [11], p. 63). 

Les classes i*(Ax) s’interprétent aussi comme classes universelles d’im- 
mersion dans M (cf. 1.5, 5.4). 


3.7%. Hxpression de oly. Remarquons tout d’abord que V?s— Vz a le 
méme type d’homotopie que le p-produit cyclique réduit V*, de V. En effet 
V?, — Va = Lx Xx(V?—V) est fibré sur V*, avec une fibre contractile Er. 
La projection g de cette fibration induit un isomorphisme naturel de H*(V*,) 
sur avec cette identification, ;*: H*(V?,) H*(V*,) 
désignera ’homomorphisme induit par l’injection 7 de V?s—V-r dans V?r. 

Toute application continue f de V dans M induit une application con- 
tinue de Vor dans Avec les 
identifications de 2. 2, 6?* n’est autre que ’homomorphisme de H*(z, H*(M)?) 
dans H*(x,H*(V)?) induit par ’homomorphisme f*?: H*(M)?-> H*(V)?. 

On vérifie aisément (notations de 1.5) que hfxq est homotope a ¢?xj. 
Done fp = j*[p?* 

Pour p= 2, on aura par exemple 


L’homomorphisme j* sera explicité au paragraphe suivant dans le cas 


\ 
5 


ANDRE HAEFLIGER. 


4, Cohomologie modulo p du p-produit cyclique réduit d’une variété. 


4.1. Suites exactes associées ad une sous-variété. Soit V une sous-variété 
fermée de codimension q d’une variété paracompacte M. Dan ce paragraphe 
Vhomologie ou la cohomologie sont a coefficients entiers ou modulo 2 suivant 
que les variétés sont orientées ou non, et la famille des supports est celle de 
tous les fermés. En utilisant la cohomologie définie par A. Borel dans [1] 
(cf. aussi [2]), U étant un voisinage ouvert de V, on a la suite exacte suivante: 


A, (V) > A,(U) > > >: 


ott le premier homomorphisme est induit par l’injection 1 de V dans U et le 
second par la restriction 4 U —V des chaines de U. Si U’ est un voisinage 
ouvert de V contenu dans U, cette suite exacte s’envoie par restriction dans 
la suite exacte analogue pour U’. En posant d’une part U = M et en passant 
d’autre part a la limite directe suivant l’ordonné filtrant des voisinages de V, 
on a le diagramme commutatif et exact suivant: 


-H,(V) H,(M) H,(M—V)—-> 


— H;(V) > lim. dir. H,(U) > lim. dir. H,(U —V) > 


Passons a la cohomologie par dualité de Poincaré (cf. [2]). L’injection 
de V dans U induit un isomorphisme de lim. dir.H"(U) sur H™(V) (cf. [4]). 
Avec cette identification et en notant H*(M \V) la limite directe de H"(U — V) 
lorsque U parcourt les voisinages ouverts de V dans M, on a 


4.2. Proposirion. Le diagramme suivant est commutattf. 


——> Ht(M—V) 


H 
| 
\ 


——>__ A (M\V) 


Les lignes horizontales sont exactes, ¢ est Vhomomorphisme de Gysin déter- 
miné par Vinjection i de V dans M, j* est induit par Vinjection de M—V 
dans M. L’homomorphisme ¢, est déterminé par le cup produit par la classe 
i*(V*), ow V* est la classe duale a V dans M. 


Hr-(V) 


Comme lim. dir H*(U) = H*(V) 


Il reste 4 vérifier ce dernier point. 


66 
] 
1 


POINTS MULTIPLES D'UNE APPLICATION. 67 
pour tout «€ H*(V), il existe un voisinage ouvert U de V et un élément 
H*(U) tel que i*p(8) =a, ot ty est Vinjection de V dans U; si ¢y 
désigne ’homomorphisme de Gysin déterminé par ty, on a = t* 
=i* yy (t*oB) (BU V*) =a U*(V*). 

Remarquons que si V est une sous-variété différentiable de M, alors 
H*(M\V) s’identifie 4 la cohomologie du bord (espace fibré en sphéres) 
d’un voisinage tubulaire de V et la deuxiéme suite exacte n’est autre que la 
suite exacte de Gysin. 

On peut compléter le diagramme ci-dessus par des suites exactes verti- 
cales en faisant intervenir le groupe H*(M mod V). 


4.3. Application a la “ sous-variété” Mz de M?z. Comme dans 3.3, 
réalisons Hx (resp. Bx) par une sphére (resp. un espace lenticulaire) de grande 
dimension (y—1)N—1. Ecrivons les suites exactes de 4.2 associées 4 la 
sous-variété M’s de M’?, et passons a la limite en faisant tendre N vers l’infini. 
Nous obtenons les suites exactes: 

(M?,) ——> H* (M?s— Mz) 
\ \ 
Hr-(0-1)m (Mz) | i* | Hr-@-1)m+1 


Nous avons vu dans 3.7% que M?;—-Mz a méme type d’homotopie que 
le p-produit cyclique réduit M*, de M. Le méme raisonnement montre que 
H*(M?z\Mz) est canoniquement isomorphe 4 H*((M?/x)\M), ot M?/x est 
le p-produit cyclique de M (quotient de M? par l’action de +) et ot M est 
identifié 4 la diagonale de M?/z. 

D’autre part $ est donné par le cup produit par la classe i*(Ar). Or le 
terme de z-degré maximum dans 1*(Az) est w™ (ef. 3.6); il en résulte que 
si a€ H*(Mz) est alors aU i*(Axr) est done que est injectif. 
En vertu de la commutativité du diagramme, ¢ est aussi injectif. 

Désignons par r* (resp. r*,) les homomorphismes induits sur la coho- 
mologie par l’injection de la fibre M? (resp. @) dans M?, (resp. Mx) par w 
’homomorphisme de Gysin déterminé par l’injection de la diagonale M 
dans 


4.4. Compte tenu de ce qui précéde, on a le diagramme commutatif 
et exact horizontalement: 


é, 
té 
ne 
ot 
Je 
le 
e 
ns 
t 
). 
) 
V 


68 ANDRE HAEFLIGER. 


H"(M?) 
M1) 


(Mz) H’(M*,) 


0—> Hr--)"(By X M) i* 


| 


4.5. L’homomorphisme j*: H™(M?z) H"(M*,) est sur- 
jectif. L’image par j* d’un élément a€ H*(M?z) est nulle si et seulement si 


r* 


a) il existe un élément BE H*(Br XM) tel que i*(a) =BU (Az) 
(B est alors unique) et 


b) yr*o(B) =1r*(a). 


Les conditions a) et b) sont évidemment nécessaires d’aprés la commu- 
tativité de 4.4 et le fait que ¢, est donné par le cup produit par 1*(Az). 

Réciproquement, supposons qu’il existe B tel que = BU 1* (Ar) ; en 
vertu de 4.4, (8) —«a appartient au noyau de i*; or ce noyau est contenu 
dans le sous-groupe des éléments de z-degré 0 de H*(M?,) = H* (x, H*(M)?) 
(cf. 2.5); comme r* est injectif sur ce sous-groupe (cf. 2.2), la condition 
r*($(B) —a) =0, équivalente 4 b), entraine «—¢(f), done j*(a) =0. 

Ce théoréme permet le calcul explicite de H*(M*,) compte tenu des 
identifications H*(M?,) = H*(x,H*(M)?), H*(Bs X M) = H* (x) H*(M), 
des expressions explicites de i* (cf. 2.5), de r* (2.2) et de i*(Ax) (cf. 3.6). 

Quant a y, il est déterminé par la formule y(y) = (y@1- - -@1)U 4, 
ou 8, est la classe duale 4 la diagonale M dans M? (on a identifié H* (J?) 
a H*(M)?); en effet, si 1) est V’injection de la diagonale M dans M?, 
y=t*,(y@1---@1) et la formule résulte de le propriété multiplicative 
de y. 


5. Conditions pour l’annulation des classes de plongement ¢/,. 


5.1. Classes caractéristiques normales d’une application. Soit f une 
application continue d’une variété V de dimension n dans une vari¢té M 
de dimension m. Soient W=> Wi (resp. W’ W%) la classe totale de 
Stiefel-Whitney de V (resp. M) et Q=S Qi (resp. Q’ = DQ”) la classe 
caractéristique (totale) modulo p de Wu de V (resp. M) (cf. 3.5). 

La classe totale de Stiefel-Whitney normale de f est la classe W;= > W', 
= WU f*(W’), ot W est défini par WU W=1. 


POINTS MULTIPLES D'UNE APPLICATION. 69 
La classe caractéristique modulo p de Wu normale de f est la classe 


6; 94, = QU f*(Q’), ob Q est défini par gQUQ=1. 
Désignons par 8 (resp. 8’) la classe duale 4 la diagonale de V? 
(resp. M?). 
5.2. Tutortme. La condition est équivalente aux conditions 
a) W*,;=—0, k>m—n pour p=2, 
Qk, =0, k > (m—n)/2 pour p>2 
et 
b) (W"",@1)U & = f?*(8) pour p= 2, 
o(Qm/?,@1- - -@1)U & — fe*(8,) pour p>2 


ou o=1 ow (—1)*/? suivant que m et n sont tous deux pairs ow impair. 


5.38. Démonstration. Reprenons les notations de 3.7, 4.4 et 4.5. La 
restriction de ¢?x 4 Vx est une application ¢z de Vz dans Mz. On a vu (3.7) 
que of, D’aprés 4.5, si et seulement si a)’ il existe 
Be H*(Br XV) tel que =BUi*(A’z) et si b)’ wr*(B) 
=r*(p?*,A™“,). Ces conditions a)’ et b)’ sont équivalentes respectivement 
aux conditions a) et b) de 5.2. Montrons le pour p= 2. 

En effet, = b*at* = La condition 
a)’ s’exprime sous la forme 


a)’ f* (W's) B U » wrt Wi, 


Multiplions les deux membres par }p"/W!. On obtient la condition 
pk, — qui équivaut W*;=0 pour k > m—n. 
On voit aussi que r*,(8) =composante de z-degré 0 de 


Dans le cas p > 2, on a r*9(B) =oQ;)"™/?, La condition b)’ est bien équiv- 
alente 4 b) d’aprés la fin de 4. 5. 


5.4. Les classes dimmersion yy. Il suit de cette démonstration que 
= 0 est équivalent aux conditions a) de 5.2. En effet = j*op*ai*(A™ x). 


5.5. COROLLAIRE (Wu). St f est wne application de V dans Vespace 
euchdien R™, la condition $f, =0 est équivalente a 


W*—0 pour k= m—n dans le cas p=2 


Q* =0 pour k= (m—n)/2 dans le cas p> 2. 


5.6. Le cas differentiable. On sait alors (cf. [6]) que la classe normale 


0 

st 
u- 

n 

es 
On, 

P) 

ne 
de 

| 


70 ANDRE HAEFLIGER. 


Q*, de f modulo p est un polynome dans les classes de Pontryagin normales 
H*i(V) de f réduites modulo p. Plus précisément, si est formelle- 
ment la i-eme fonction symétrique .élémentaire -,2°n), alors 
est la k-éme fonction symétrique élémentaire +, Un calcul 
immédiat montre que la condition Q*;—=0 pour k > (m—n) /2 est équivalente 
& (p*;)" =0 modulo p pour k > (m—n) /2 et qu’alors 
modulo p, ot h= (p—1)/2. 


INSTITUTE FOR ADVANCED STUDY. 


BIBLIOGRAPHIE. 


[1] A. Borel, “ The Poincaré duality in generalized manifolds,” Michigan Mathematical 
Journal, vol. 4 (1957), pp. 227-239. 

et J. Moore, “ Homology and duality in generalized manifolds,” Exp. II du 
Seminar on transformations groups, Annals of Mathematics Studies No. 46, 
Princeton, 1960. 

[3] H. Cartan et S. Eilenberg, Homological algebra, Princeton, 1956. 


[2] 


[4] R. Godement, Topologie algébrique et théorie des faisceaua, Actualités Scientifiques 
et Industrielles, Paris, 1958. 

[5] A. Haefliger, “ Sur les self-intersections des applications différentiables,” Bulletin 
de la Société Mathématique de France, vol. 87 (1959), pp. 351-359. 

(6] F. Hirzebruch, “ On Steenrod’s reduced powers, the index of inertia and the Todd 
genus,” Proceedings of the National Academy of Sciences, USA., vol. 39 
(1953), pp. 951-956. 

[7] A. Shapiro, “ Obstructions to the imbedding of a complex in a euclidean space. 
I. The first obstruction,” Annals of Mathematics, vol. 66 (1957), pp. 256- 
269. 

[8] N. Steenrod, “ Homology groups of symmetric group and reduced power opera- 
tions,” Proceedings of the National Academy of Sciences, USA, vol. 39 
(1953), pp. 213-223. 

[9] , “ Existence and uniqueness of the cyclic reduced powers,” to appear. 
[10] R. Thom, “ Espaces fibrés en sphéres et carrés de Steenrod,” Annales Scientifiques 
de VEcole Normale Supérieure, vol. 69 (1952), pp. 109-181. 

[11] ———., “ Quelques propriétés globales des variétés différentiables,” Commentarii 
Mathematici Helvetici, vol. 28 (1954), pp. 17-86. 
[12] W. T. Wu, “Classes caractéristiques et i-carrés d’une variété,’ Comptes Rendus 
(Paris), vol. 230 (1950), pp. 508-511. 
, “On the realization of complexes in euclidean spaces, I, II, III,” Scientia 
Sinica, vol. VII, No. 3 (1958), pp. 251-297, No. 4 (1958), pp. 365-387, 
vol. VIII, No. 2 (1959), pp. 133-150. 


[13] 


FINITE GROUPS ADMITTING A FIXED-POINT-FREE 
AUTOMORPHISM OF GRDER 4.* 


By DANIEL GoRENSTEIN and I. N. HERSTEIN. 


Recently, in a remarkable piece of work [4,5] John Thompson has proved 
a result which implies as an immediate corollary the well-known Frobenius 
conjecture, namely that a finite group admitting a fixed-point-free auto- 
morphism (i.e., leaving only the identity element fixed) of prime order must 
be nilpotent. However, non-nilpotent groups are known which admit fixed- 


point-free automorphisms of composite order. In all these cases one notices 


that the groups in question are solvable. Although the sample is rather 
restricted, it is not too unnatural to ask whether the condition that a 
finite group admit such an automorphism is strong enough to force solv- 
ability of the group. This question is related to another problem, which seems 
equally difficult, which asks whether a finite group containing a cyclic sub- 
group which is its own normalizer must be composite. 

In the present paper we shall prove that a group G@ possessing a fixed- 
point-free automorphism of order 4 is solvable. Although many of the ideas 
used carry over to the case in which ¢ has order pq, and especially 2q, our 
key lemmas use the fact that ¢ has order 4 in a crucial way. 

The proof depends upon a theorem of Philip Hall which asserts that a 
finite group G is solvable if for every factorization of 0(G@) into relatively 
prime numbers m and n, G contains a subgroup of order m. We show 
(Lemma 7) that a group G which has a fixed-point-free automorphism of 
order 4 satisfies the conditions of Hall’s theorem. 

Once we know that G is solvable it is not difficult to prove that its 
commutator subgroup is nilpotent (Theorem 2). This fact was also observed 
by Thompson. 

Graham Higman has shown [3] that there is a bound to the class of a 
p-group P which possesses an automorphism ¢ of prime order q without fixed- 
points. This does not carry over to automorphisms of composite order, for 
at the end of the paper we give an example due to Thompson of a family of 
p-groups of arbitrary high class each of which admits a fixed-point-free auto- 
morphism of order 4. 


* Received July 8, 1960; Minor revision December 8, 1960. 


es 
k 

f 
ul 
te 

al 

u 
16, 

es 

in 

id 
39 

e, 

6- 
39 

es 
ri 

us 

ia 
37, 

71 


DANIEL GORENSTEIN AND I. N. HERSTEIN. 


1. We begin by recalling a few well-known elementary results concerning 
a finite group @ which admits a fixed-point-free automorphism ¢ of order n 
and in particular when n= 4. First of all, for any prime p|o(G) there is a 
unique p-Sylow subgroup P of G which is invariant under ¢. We shall call 
P the canonical p-Sylow subgroup of G (with respect to ¢). Furthermore, 
for any z in we have the relation =1. 

If n=4, each orbit under ¢ except for that consisting of the identity 
contains either 2 or 4 elements, hence G is necessarily of odd order. The set 
of elements of G left fixed by ¢? is a ¢-invariant subgroup of G, which we 
denote by F. If “1, the restriction of ¢ to F is an automorphism of F of 
order 2 without non-trivial fixed elements. This implies that F is Abelian 
and that ¢(f) =f for all f in F. Finally, we shall denote by J the set of 
all h in G for which ¢?(h) =fh-*. It is worth observing that J need not be 
a subgroup of G. 

Throughout the paper G will denote a finite group having a fixed-point- 
free automorphism ¢ of order 4, F' will denote the subgroup left elementwise 
fixed by ¢? and J the subset consisting of those elements of G which are 


mapped into their inverses by ¢?. 


Lemma 1. G=FI=IPF. 


Proof. If 1¢7(x) for some x in G, then ¢7(z) = ¢?(a")¢*(2) 
= whence z€ J. Furthermore, 2*¢7(x) = y*¢?(y) implies 
that ¢?(xry-?) = zy" and hence that ryt ¢€ F. Thus J contains at least [@: F] 


elements. 


To complete the proof, it will clearly suffice to show that distinct elements 
of J lie in distinct right (or left) cosets of F. If he=fhi, hi, ho€ I, fe F, 
it follows by applying ¢? that fh,*. Combining this with the previous 
relation gives h,*fh,—f. Since G@ is of odd order, this forces f—1 and 
hence h; Similarly, we show that h.—=h,f implies h, = ho. 


Lemma 2. If f,, fo m F are conjugate in G, then f, = fy. 


Proof. Suppose zf,z-'—f,. Since F is Abelian, we may assume without 
loss, in view of Lemma 1, that x¢ I. Applying ¢? gives z-"f,7 = f., whence 
2 centralizes f;. Since G@ is of odd order, z centralizes f,, and consequently 


1= fe. 


As an immediate corollary, we obtain 
Lemma 3. Any subgroup of F is in the center of its normalizer. 


Lemma 4. If he I, h commutes with (h). 


72 


FINITE GROUPS 73 


Proof. This lemma follows at once from the relations ho(h)¢7(h)¢3(h) = 1 
and ¢7(h) =h-. 

LemMa 5. For any p|o(G), F normalizes the canonical p-Sylow sub- 
group P of G. 


Proof. If FA P=1, ¢? is an automorphism of P of order 2 without 
non-trivial fixed elements, whence =a" for allx€ P. ThusPCIJ. Let 
f¢ F and consider P’ =fPf*. If y=fzf", with P, 
which implies that P’ C I. 


Suppose P’ ~P; choose y in P’ and not in P. The subgroup generated 
by y and its image under ¢ is ¢-invariant and, since y € J, it follows from the 
preceding lemma that this subgroup is a p-group. Let P, be a maximal ¢- 
invariant p-group containing y. If P, were not a p-Sylow subgroup of G, 
the unique ¢-invariant p-Sylow subgroup P, of Ng(P;) would have order 
greater than o(P,) and would contain P, and consequently y. Since this 
would contradict our choice of P;, P; must be a p-Sylow subgroup of G. 
Since P is the only ¢-invariant p-Sylow subgroup of G, P,;—=P, which is 
impossible since y€ P;, y¢ P. We conclude that P’—fPft—P. Since f 
was arbitrary, C Ng(P). 

Suppose, on the other hand, that FA PA~A1. In this case we shall 
prove the lemma by induction on 0(G@). Since F is Abelian, FM P is a ¢- 
invariant p-group which is normalized by F. If P, is a maximal ¢-invariant 
p-group which is normalized by F, it follows first of all as in the preceding 
paragraph that P; C P. Suppose P;} <P. We must have Ng(P:) =G, for 
otherwise by induction F normalizes the unique ¢-invariant p-Sylow sub- 
group P, of Ng(P:) and o(P2) >0(P;). Thus P, dG. Set G=G/P, and 
let @ be the image of ¢ on G. ¢ has no non-trivial fixed elements and is of 
order 2 or 4. If P, F denote the images of P, F in G, it follows by induction 
(or from the fact that @ is Abelian in the case ¢?—1) that F C NG(P). 
Thus F C Ng(P). 


Lemma 6. If A, B are two ¢-invariant subgroups of G which are each 
normalized by F, then ABF is a ¢-invariant subgroup of G of order dividing 
0(A)o(B)o(F). 


Proof. Since BF is a subgroup, ABF will be a subgroup if (BF)A 
=A(BF). Since F normalizes A, it will suffice to show that BA C ABF. 


Since A is ¢-invariant, it follows from Lemma 1 applied to A that for 
any ain A, a=a’f,, where € and f,€ FN A. Similarly, for any 


1g 
n 
a 
ll 
"e, 
y 
et 
yf 
in 
of 
i= 
se 
re 
) 
| 
d 
t 


74 DANIEL GORENSTEIN AND I. N. HERSTEIN. 


in B, b=f.b’, f.€ FOB and b’€ IN B. Clearly, ba—f,b’a’f,€¢ ABF if and 
only if b’a’€ ABF. 

Now b’1a’1 = fh for some f in F, h in I; applying ¢? gives b’a’ = fh". 
Since h-t —a’b’f from the first relation, b’a’ = fa’b’f —a’b’’f?, where a” € A, 
b”’¢€ B. Thus ABF is a subgroup as asserted. The remaining parts of the 


lemma are immediate. 


Lemma 7. Let pi, po,- +, px be a set of primes dividing 0(G) and let 
P,,P.,: - +, Px be the corresponding canonical Sylow subgroups of G. Then 
P,P,: - - Py 1s a subgroup of G. 


Proof. By induction on k we may assume that H=P,P.: + -Py1 isa 
subgroup of G. Clearly H is ¢-invariant. By Lemma 5 F C Ne(H) and 
FC Ng(P;), so that S=HP,F is a ¢-invariant subgroup of G by Lemma 6. 
Since 0(S)|o0(H)o(P,)0(F), a g-Sylow subgroup Q of F for any prime q ¥ pi, 
i=1,2,---,k, is a g-Sylow subgroup of S. By Lemma 3 Q is in the center 
of its normalizer in 8, so that by a well-known theorem of Burnside 8 
possesses a normal g-complement L,. Since L, consists of the elements of S 
of order prime to g, L, contains H and P;. Repeating this argument for each 
such prime g|o(F’), we readily conclude that - Pi, 


q|o(F) 


which, being an intersection of subgroups, is a subgroup. 


Lemma ” leads at once to our main result. 


THEOREM 1. If G 1s a finite group admitting an automorphism of 
order 4 leaving only the identity element of G fixed, then G 1s solvable. 


Proof. It follows from Lemma 7 that for any factorization of 0(G@) into 
the product of relatively prime numbers m and n, G contains a subgroup of 
order m. By a theorem of Philip Hall ([2], Theorem 9.3.3, p. 144), this 
implies that G@ is solvable. 


2. We shall now examine the structure of G more closely, For our 


main result we need several lemmas. 


Lemma 8. If G=HM, where H is nilpotent, normal in G, (0(H),0(M)) 
=1, M ts invariant under ¢ and MQ F}=1, then G=H X M. 


Proof. Let @(H) be the Frattini subgroup of H and set G= G/#(I/) 
= HM. Since (0(H),o0(M)) it follows from the properties of the 
Frattini subgroup that G= H M implies G=H XM. Hence, without 
loss, we may assume that H is elementary Abelian. Since ¢? leaves only the 


FINITE GROUPS 75 


identity element of M fixed, M is Abelian. If either H contains two disjoint 
¢-invariant subgroups normal in G or ¢ does not act irreducibly on M, the 
lemma follows easily by induction. Hence we may assume that ¢ acts irre- 
ducibly on M and no proper ¢-invariant subgroup of H is normal in G. In 
particular, this implies that H is an elementary Abelian p-group for some 
prime p. 

The holomorph of ¢ and M is represented irreducibly on H regarded as 
a vector space over the prime field K, with p elements. Let H* be the corres- 
ponding vector space over the algebraic closure K*, of Ky. If M does not 
centralize H, it follows from Lemma 3.1 of [1] that with respect to a suitable 
basis of H* the matrix of ¢ assumes the form 


with b;€ K*,. Since ¢*—1, and hence —1 for all 1. But this 
means that 1 is a characteristic root of ¢ and hence that ¢ leaves some element 
of H other than the identity fixed. This contradiction forces M to centralize 
H, and consequently G—=H X M, as asserted. 


Lemma 9. If G=HM, where H is nilpotent, normal in G, (0(H), 0(M)) 
=1, M ts invariant under ¢ and Cg(H) C H, then MC F. 


Proof. By Theorem 1 G and hence M is solvable. Let K be a maximal 
g-invariant normal subgroup of M. By induction applied to HK, K CF. 
Let M M/K and let ¢ be the image of M. If on M, it follows 
readily that MC F. Hence we may assume that ¢ has order 4 on M. Since 
M is elementary Abelian and ¢ acts irreducibly on M, ¢?(¥) =" for all 
yin M. Thus for all y in M, ¢?(y) K. Now if K, yry* 
for some 2’€ K. Applying ¢? gives ytzrz1y—2’. Since K is abelian we 
easily conclude that y? and consequently y centralizes x. Hence K is in the 
center of 


As in the previous lemma we may assume without loss of generality that 


1d 
A, 
1e 
et 
a 
d 
$1 0 
T > 
| 
where 
010 0 
1 
b 0 0 0 
| 
) 
|) 


76 DANIEL GORENSTEIN AND I. N. HERSTEIN. 


H is elementary Abelian. By Lemma 1 applied to VM, M=(FN M)IN MN), 
and under our present assumptions 1% M+ 1. By induction we may suppose 
that no ¢-invariant proper subgroup of H is normal in H; and we shall then 
derive a contradiction by showing that IM M centralizes H. 

Now Cg(XK) is ¢-invariant and contains M, since K is in the center of J. 
Since H,=HNC¢(K) contains M and H, whence 
H,<1G. Since H, is invariant under ¢, the minimality of H implies that 
either H, —1 or H, =H. 

Suppose first that H, —1. Since K C F and Abelian HN FC Ce(k), 
whence H F ~1 and consequently HCI. If ye MOI, A, yry* =v’ 
for some 2’ in H. Applying ¢?, we obtain = which together with 
the preceding relation implies that x and y commute. Thus 1M M C C¢(H) 
as asserted. 

On the other hand, if H,—H, K centralizes H, whence K —1 and 
InM=M. Since MNF=1, G=HXM by Lemma 8. Thus [NV 
centralizes H, completing the proof. 


LemMA 10. G has p-length 1 for all p|o(@). 


Proof. Since G is solvable by Theorem 1, the statement of the lemma is 
meaningful. The proof will be by induction on 0(G). Let M be the maximal 
normal subgroup of G of order prime to p, and assume first that M-1. 
M is ¢-invariant since it is characteristic in G. Let ¢ be the image of ¢ 
on G=G/M. If ¢?=—1 on G, G is Abelian. If ¢ has order 4 on G, it 
follows by induction that G has p-length 1 and hence that a p-Sylow sub- 
group P of Gis normal in G. In either case we conclude that G@ has p-length 1. 
We may therefore suppose that M —1. 

Let P, be the maximal normal p-group of G and consequently ¢-invariant. 
Let P be the canonical p-Sylow subgroup of G and P its image in G = G/P,. 
If K is the maximal normal subgroup of G of order prime to p, it follows 
by induction that the image of P in G/K is normal in G/K whence PK <J G. 
If PK < G, its inverse image G, < G@, and hence by induction has p-length 1. 

Since P, contains its own centralizer in G ([2], Theorem 18. 4. 4, p. 332), 
G, contains no non-trivial normal subgroups of order prime to p and hence 
P<qG). But then P <! PK, and since P is characteristic in PR, we conclude 
that P <4 G, whence P 3G. 

We may therefore assume that @ = PK. The inverse image G, of K is 
of the form P,K, where K has order prime to p. Since G, is solvable, any two 
subgroups of G, of order o(K) are conjugate ([2], Theorem 9.3.1, p. 141). 
One can now show by the same argument which proves the existence of 


FINITE GROUPS Ce 


canonical p-Sylow subgroups that there exists a unique conjugate of K in G, 
which is invariant under ¢. Without loss we may assume K itself is ¢- 
invariant. Since Cg,(P:)C P:, the previous lemma implies that K C F. 
But then by the argument of the first paragraph of the lemma XK is in the 
center of G, whence P< G and PG. 


THEOREM 2. If G possesses an automorphism ¢ of order 4 leaving only 
the identity element fixed, then the commutator subgroup of G@ is nilpotent. 


Proof. G is solvable by Theorem 1. Assume first that G contains two 
minimal ¢-invariant normal subgroups NV, and N». Since the image of ¢ 
on G/N, and G/N, has no non-trivial fixed elements, the commutator sub- 
groups [G/N;,G/Ni] of G/Ni, 11,2, are nilpotent by induction. Let H; 
be the inverse image of [G/Ni,G/Ni] in G and sett H=H,NH2. Clearly, 
H<G and [G,G|]CH. Furthermore, if z and y are elements of relatively 
prime order in H, their images in G/Ni, 11,2, commute, and hence 
ytayxt€ Since N, and N, are distinct minimal normal ¢-invariant 
subgroups of G, VN, N.—1, and consequently z, y commute. Thus H and 
hence [G, G] is nilpotent. 

We may therefore suppose that G contains a unique minimal normal 
¢-invariant subgroup N,. WN, is a p-group for some prime p and G contains 
no non-trivial normal subgroups of order prime to p. Since G has p-length 1 
by Lemma 10, a p-Sylow subgroup P of G is normal in G. Now C¢(P)dG 
and Cq(P) =Z(P) xX K, where K has order prime to p. Since K is charac- 
teristic in Cg(P), K is normal in G, whence K ~1 and Cg(P)C P. Further- 
more, GPM for some subgroup M of G, and we may assume M is invariant 
under ¢. Since (0(P),0(M)) =—1, we can apply Lemma 9 to conclude that 
MCF. Thus M is Abelian, and consequently [G,G]C P is nilpotent. 


3. We conclude with an example of a family of p-groups of arbitrarily 
high class each of which has a fixed-point free automorphism of order 4. Let 
p be any prime such that p=1(mod4) and let P; be an elementary Abelian 
p-group of order p‘, where = d an arbitrary integer, and let 2, 
be a basis for P,;. We construct an extension of P, by adjoining a new letter 
y satisfying the relations: 


yi =1, yty?* = yoy? 
These relations define a p-group of order tp? and of class t. 


Since p=1 (mod 4), there is an integer such that «#?==—1 (mod p). 


78 DANIEL GORENSTEIN AND I. N. HERSTEIN. 


We define an automorphism @ of P by setting 6(y) =y, 0(2:) =2;*. For 
6 to be an automorphism, its value on 2; must be such that 


= 8 (21) (tir). 


Assume 6 has been defined on 2x; for 7 >% and that 6(2;) is in the sub- 
group generated by We shall show that 6(2;) can be defined 
satisfying (**) and subject to the restriction = It 
follows at once from (**) that we have + = 
Since these relations have a solution for aj, di.1,- * +, @t-1 (for any choice of a;), 
the automorphism 6 exists. 

Regarding P, as a vector space, it is easy to see that the matrix of 
6 with respect to the basis 2,,%,---+,2 has the form «D+ N, where 
D = diag(1,—1,1,—1,---,1) and WN is a strictly triangular matrix. It 
follows that the order of 6 on P, is 4p* for some sd. Setting ¢—6", 
¢ has order 4 on P, and since ¢(y) =y", ¢ has order 4 on P. The charac- 
teristic roots of @ are + a, the same as those of 6. Since ~~ +1, ¢ leaves 
only the identity element of P, fixed. Since ¢(y) =y~, this implies that ¢ 
leaves only the indentity element of P fixed. 


CLARK UNIVERSITY, 
CORNELL UNIVERSITY. 


REFERENCES. 


[1] Daniel Gorenstein, “ Finite groups which admit an automorphism with few orbits,” 
Canadian Journal of Mathematics, vol. 12 (1960), pp. 73-100. 

[2] Marshall Hall, The Theory of Groups, New York, 1959. 

[3] Graham Higman, “Groups and rings having automorphisms without trivial fixed 
elements,” Journal of the London Mathematical Society, vol. 32 (1957), 
pp. 321-334. 

[4] John G. Thompson, “ Finite groups with fixed-point-free automorphisms of prime 
order,” Proceedings of the National Academy of Sciences, vol. 45 (1959), 
pp. 578-581. 

[5] , “Normal p-complements for finite groups,” Mathematische Zeitschrift, vol. 
72 (1960), pp. 332-354. 


ON INDUCED REPRESENTATIONS.* 


By Ropert J. BLATTNER. 


1. Introduction. The purpose of this paper is threefold: to lay the 
foundations of a theory of induced representations of (not necessarily separ- 
able) locally compact groups, to prove a sharpened form of an intertwining 
number theorem due to Bruhat, and to prove a disjointness theorem for 
representations of Lie groups induced from compact subgroups. G. W. Mackey 
in [10] developed a notion of induced representation when the group G 
induced up to is separable and the inducing representation L is in a separable 
Hilbert space Y. Because the construction rested on the choice of a quasi- 
invariant measure in the relevant homogeneous space and because the existence 
of such measures is problematical when G is not separable, this definition is 
not suitable for generalization to the non-separable case. The definition we 
employ, which is equivalent to Mackey’s when G and W are separable, is a 
modification of the one used by F. Bruhat in [3]. The chief novelty is the 


way in which the Hilbert space structure is defined on the function space # 
in which the induced representation U” operates, a way which owes much to 
Mackey’s notion of intrinsic Hilbert space. Section 2 deals with the basic 
properties of induced representations, concluding with the theorem on induc- 


tion in stages (Theorem 1). 

In Sections 3 through 5 we are concerned with the problem, already 
considered by Bruhat, of finding the intertwining number of two inductions 
when G is a Lie group. Our method is based on two facts: (1) any function 
f in & which is in the domain &, of all the operators of the differential 
representation 0U” of the enveloping algebra € of the Lie algebra of @ is 
essentially continuous; (2) if X € € is elliptic as a left invariant differential 
operator on G and has sufficiently high order, then f(e) may be estimated 
in terms of 0U“(X)f when f€ #.,. Our estimate of the intertwining number 
(Theorem 3) is in terms of the dimension of a certain space of distributions, 
the order of which is usually much lower than that needed in Bruhat’s theorem. 
This would seem to result from our use of unitary representations throughout 
our discussion and our replacement of Schwartz’s kernel theorem by the facts 
on elliptic operators mentioned above. The importance in other connections 


* Received July 19, 1960. 
79 


or 

b- 

ed 

It 

); 

It 

C- 

d 

e 

1, 


80 ROBERT J. BLATTNER. 


of the theory of elliptic operators for Lie group representation theory has 
been established in recent papers by Stinespring ([15]) and Nelson and 
Stinespring ([12]). In Section 6 we prove a disjointness criterion for induc- 
tions to Lie groups of finite dimensional representations of compact subgroups 
(Theorem 4). This is accomplished by applying direct integral techniques 
to the results of Section 5. The paper concludes with an example in Section 7. 


Notation. If f is a numerical function on a set S, || f ||y = LUB[]| f(s)|: 
s€ 8]. If O is an open subset of a topological space and if 2% is a topological 
linear space, C(O;9¢) denotes the class of all continuous functions from 
O to & and C,(O; %) denotes the subclass of C(O; 26) consisting of 
functions with compact support. A superscript oo (resp. +) indicates restric- 
tion to indefinitely differentiable (resp. non-negative) functions. If Y is 
another topological linear space, £(26;Y) is the space of all continuous 
linear maps of &% into Y equipped with the topology of bounded convergence. 

The author wishes to acknowledge his indebtedness to R. S. Phillips for 
several conversations on partial differential operators. 


2. Induced representations. Let G be a locally compact group, T a 
closed subgroup of G, and ZL a unitary representation of T on the Hilbert 
space Y. Let right Haar measure be chosen in G and T, and let their 
respective modular functions be A and 8. Let ¥* be the set of all functions 
f from G to UV such that: (1) f(-) is Bourbaki measurable (see [2], p. 180) ; 
(2) =A(E)48(E)*Lef (x) whenever €€T and (8) || f(-) ||? is 
locally integrable. Let M—G/TI (right coset space) and let a be the 
canonical projection of Gon M. As in Lemma 1.5 of [10], || f(-) ||? defines 
a positive Radon measure py on M via the equation 


7G 7M 


where and (rg) (x(z)) g(éx) dé. Set || f and 


F = [fe F*: || f || <oo]. One easily sees that F¥*and F are linear spaces. 

If f,g€ F, then (f(-),g(-)) is Bourbaki measurable ([2], Proposition 
10, p. 193) and (f(-),g(-)) defines a finite complex vaiued Radon measure 
on M, call it wy», in the same way that || f(-)||? defines yy We set (f,9) 
=pyg(M). (+,°) is a positive semi-definite Hermitian form on F and 
lf == (f,f)% Clearly || f || =0 if and only if f(-) =0 locally almost every- 
where (l.a.e.). Thus, if we set MH=—GF/[fe F: f(-) =0 la.e.J, we may 


ON INDUCED REPRESENTATIONS. 81 


transport (-,-) to & and & is then a pre-Hilbert space. (In the sequel, we 
shall often willfully confuse ¥ and #.) 

In order to show completeness, we need the following estimate, which is 
also useful in other connections. 


Lemma 1. For each compact subset K of G, there is a constant Ag such 
that, for all fe F, 


K 


Proof. Choose g€ Co*(G) such that g=1 on K. Then 


Set Ag = (|| 7g |lar f dx) and apply the Schwarz inequality. 
K 


Proposition 1. & 1s complete. 


Proof (after Riesz-Fischer). Let {f,} be a Cauchy sequence in ¥. As 
usual it suffices to show {fn} has a limit in ¥ under the added assumption 
that || || First we show that for locally almost all G, 
Lim f,(xz) exist in Y. In fact, let K be a compact subset of G. Lemma 1 


tells us that f fn(@) —finsr(@) || da < 2"Ax, whence 
K 


I) de < Ae. 


| Therefore for locally almost all x€ K, {fn(x)} is Cauchy in UY. Set f(z) 
=Limf,(z) or 0 according as this limit exists or not. 


Clearly f satisfies properties (1) and (2) for #*. We must show that f 
satisfies (3), that || f || <0, and that ||fa—f||>0. Let gé€C.*(@). 


Iterating the parallelogram identity for VU, we see that 


= 24 ll — Fras ll? < || 7g 


By Fatou’s lemma, —f(2) 2g (ae) de <2"? || cg lla. Letting K 
| be compact in @ and setting g—1 on K, we see that | f,(-) —f(-)||? is 


6 


nd 
1C- 
ps 
les 
it 
ym 
of 
ic- 
is 
us 
ce, 
or 
a ] 
Tt 
air 
| 
is 
he 
2S, 
on 
re 
rd 
y- 
ay 
| 


82 ROBERT J. BLATTNER. 


integrable on K. Hence f€ F*. Letting g be arbitrary in Cy*(G), we see 
that || f-—f || =2-*r?, whence f€ ¥. Finally, || f-—f || 0. 

The following is a restricted form of Bruhat’s version ([3],§4) of a map 
first introduced by Mackey ([10], §3). Let C.(G) and VU. Form 


(2) = (Ex) Le dé, 


The support of «(f,v) is contained in IK if the support of f is K. Letting 
F, denote the subspace of ¥* consisting of functions which are continuous 
and have compact support modulo T, we have ¥$,C F. Clearly «(f,v) € F, 
and is bilinear. Two important facts about « are summarized in the following 
lemma (cf. [10], Lemma 3.1 and 3.5). 


Lemma 2. (a) If K is the support of f, lle(f,v)|| SAx|lf vl. 
(b) If D is total in V, then e(C.(G)X D) is total in H. 


Proof. (a) Choose h€ Co(G) so that rh =1onaw(K). Letgé ¥. Then 
(e(f,0),9) = (e(f,») (2),9(@)) 

G 

— (OA (Le, ae 


Using the Fubini theorem and the invariance properties of Haar measure, 


this becomes f h(é)f(2) (v, g(x)) déde — f f(x) (v,g(2))dx by the 
G 

choice of h. Lemma 1 and the Schwarz inequality give |(e(f,v),9)| 
= Ax || v |] and our result follows. 

(b) Suppose (e(f,v),g) =0 for all f€ C.(G@), ve D. 
The calculation in (a) shows that for each v€ D, (v,g(-)) =Ol.a.e. Using 
[2], Proposition 10, p. 193, we see that g(-) =0 l.a.e. 

Remark. If G is a Lie group, the above proof shows that e(Co” (G) X UV) 


is total in 

For any function f on G, define R,f by (R,f) (x) =f (xy). Clearly 
carries #* into ¥*. We assert that || R,f || —|| f || for ye G, fe F*. In fact, 
let g€ C,(G). Then 


G 7M 


f (R’,-+7g) (p) duy(p), 
M 


ON INDUCED REPRESENTATIONS. 83 


where (R’,h) (p) =h(py) for any function h on M and any y€G. Taking 
the sup over all g such that 0 <7g <1, we have our assertion. 

For y € G, let U%, be the unitary map of & onto itself defined by Ry. 

PROPOSITION 2. The map U¥ sending y>U", is a continuous unitary 
representation of G in &. 

Proof. The representation property is clear. For continuity, let f,g€ Fo 
and let N be a compact neighborhood of e in G. The supports of all the 
measures py,7,9, y € NV, are contained in some common compact subset K of J. 
Choose h€ Co(G) so that rh =1 on K. Then, for ye N, 


(Flay), 9 (2) ae, 


which is a continuous function of y by standard theorems on integration. Our 
result follows because Fo is dense in F by Lemma 2. 

U" is called the representation of G induced by L. If it is not clear 
from the context what group is being up to, we shall write GU”. 


Remark. In case G and M are separable, our definition of U” and 
Mackey’s ([10]) are equivalent. In fact, let v be a quasi-invariant measure 
on M defined by a p-function p ([10]), Lemma 1.4). If f€ Fo, it is readily 
seen that p3f¢ “MH” ([10]), §2) and that the map f— pf is isometric. The 
image of ¥, under this map is easily seen to satisfy the conditions of Lemma 
3.3 in [10]. Because of that lemma and our Lemma 2, the map extends to a 
unitary map of & onto “4”, which sets up the required equivalence. We 
note that our definition of induced representation is fixed once Haar measure 
has been fixed in G and f. Moreover, for different choices of Haar measure 
we obtain the same & with the norm changed by a multiplicative constant. 
Thus the problem in [10] of showing that all ways of defining the induced 
representation are equivalent is trivialized. 

We end this section by proving the theorem on induction in stages: 

THEOREM 1. Let T, and YT, be closed subgroups of G with T, CT,. Let 
L be a unitary representation of T, on UV and denote the inductions of L to 
lr, and G by M and U respectively. Then U is unitarily equivalent to U™. 


Proof. Let 8, 8:, and A be the modular functions for T,, T., and G 
respectively. Let F®, ¥@, and F denote the spaces for the inductions from 
lr’, to T., T, to G, and ©, to @ respectively, corresponding to the space ¥ in 
our construction. Let f€ ¥) with support in T,K, compact. For 7€T., 


TE G, set f(y, ©) (nr). Let be fixed. Then 
f (&, 2) = 5, (£)#32(€) #Lef(n, 2), €€Ti,7€ To. 


see 
ap 
ne 
fo) 
us 
fo) 
! 
e, 
ne 
| 
| 
ig 
y 
t, 


84 ROBERT J. BLATTNER. 


Moreover /(-,2) is continuous with support in Thus /(-,2) 
is a member of ¥“),, which we denote by f(z). Now 


fxr) 82 (£)3A (nf, 


for 7, £€T., G, so that (fx) 4M f(x). The support of f(-) 
is in I,K. To prove continuity, let N be a compact neighborhood of e in G 


and choose h € C,(G) so that f h(éx) dé =1 on Then f h(&nx) dé 
=1 for »€T,(KNz'nT,). Hence 


whenever yz€ N, and the continuity of f follows from the uniform con- 
tinuity of f on compact sets. Thus /(-) is a member of #@), which we denote 


by 7. Now || f(z) |?= 82(9) 7A (yx) || ||? dy. Choose € Co(G) 
so that J. k(nx)dy=1o0nT.K. Then, using the Fubini theorem and the 


invariance of Haar measure, 

G 

Tn 

G 


by the choice of h and k. Thus f—f is an isometry of F¥, into F@. 
We next assert that the image of ¥, in F®), is dense. Let g€ C,(T.), 


h€C(G), v€Y, and set = (£4) h (Ea) dt € Co(G). 
Then 
(2) = f° J. 8, BA (Lé) 4g (£4) (Léa) dtdé so that 


(€) 452 (2E*) (£) 89 (fa) dgdé 
— J, (6A Ler aa, 


ON INDUCED REPRESENTATIONS. 85 


Letting «, «2 be the e-maps for the inductions from IT, to T., T, to G 
respectively, we see that «(k,v)~ —e2(h,e(g,v)). By Lemma 2, the set 
e(Co(G) X «.(Co(T2) X V)) is total in ¥, proving our assertion. Finally, 
we see that the map f—/ extends to a unitary map of ¥ onto #@ which 
sets up the desired equivalence. 


3. The representation 0V. Let G be a Lie group and let V be a 
unitary representation of G on the Hilbert space K. Let X€q, the left 
invariant Lie algebra of G, and let x(-) be the one-parameter subgroup of 
G@ such that (Xf) (y) for all Coe (G). dV(X) will 
denote the skew-adjoint infinitesimal generator of the one-parameter unitary 
group V2,-) in K. We will denote by K, the largest submanifold of K con- 
tained in {\[dom(dV(X)): X€g] and invariant under dV(g). Ka is 
invariant under V because dV(X)V,—V,dV((ady")X), X€qg, yE G. 

Let 0V be the restriction of dV to K.. 


LemMA 3. XK, is dense in K and OV 1s a representation of g in the Lie 
algebra of all skew symmetric linear transformations of K.. into uself. 


Proof. Let D be the linear space spanned by all vectors of the form 


f dz for f€ (@), ve 


Garding has shown ([5]) that DC ()[dom(dV(X)): X € g], is dense in K, 
and is invariant under all dV(X), and that the restriction 8V of dV to D 
is a representation of g. Therefore DC XK, and K, is dense in K. Segal 
has shown ([13]) that 8V(X) is essentially skew-adjoint, Y € gq. 

Let X,Y €g. Then V(X) +0V(Y) D8(X + Y) and [0V(X), dV(Y) } 
~ 6V([X,¥Y]). Since the left hand members are skew-symmetric, they must 
be contained in the closures of the respective right hand members. Restricting 
these closures to K., we have our result. 

Let € be the enveloping algebra of the complexification of g. Lemma 3 
allows us to extend dV to be a representation of © in the algebra of endo- 
morphisms of K.~. If + is the unique involutory conjugate linear anti- 
automorphism of € such that X*——YX for X€q, it is easy to see that 
dV (X+) COV (X)* for XE E (cf. [12]). 


Lemma 4. Let V* and V? be unitary representations of G on K1 and K? 
respectively. Let A€ &(V1,V?), the space of operators intertwining V1 and 
V? ({10],§8). Then AK*,€ K*,. Moreover AdV*(X) CAV2(X)A for 
X€ &, 


ROBERT J. BLATTNER. 


Proof. Let X € g and let x(-) be the one-parameter subgroup of G under- 
lying X. We have R. Hence, if v€ dom(dV*(X)), 
Av€ dom(dV?(X)) and dV?(X)Av = AdV*(X)v; i.e., dV*(X)A D AdV*(X), 
We conclude that AK*,C[dom(dV?(X)). X€g] and that AX’, is 
invariant under dV?(g). Therefore AK’, C It follows that AdV'(X) 
CéV?(X)A for X€g. The same statement for X € € is immediate. 

In the next two lemmas, we return to the situation of Section 2. G is 
a Lie group. We shall denote by #o* the space of all infinitely strongly 
differentiable functions in Fo. 


Lema 5. CH,. Moreover 0U4(X)f =Xf for all X € G, fe F,*. 


Proof. Let X€g and let z(-) be the one-parameter subgroup of G 
underlying X. Let f€ #,*. There is a compact subset K of M such that 
contains the supports of Xf and Raf, |t| Choose h€ Co*(@) 
such that sh =1on K. Then 


for0<|t|/S1. But 


ll €*(F(yx(t)) —f(y)) — AP) IF S2 (ye I? 
+2 || (XP) (y) |? S 2 (XP) (ye |? | S 1] 
+2 (XP) (y) I? S 4 (AP) (y) 


The bounded convergence theorem applies to show that dU4(X)f—=Xf. From 
this the lemma follows exactly as in Lemma 4. 


Lemma 6. Let fECo*(G), ve V. Then e(f,v)€ HM. Moreover 
0U4(X )e(f,v) =e(Xf,v) for all XE E. 


Proof. Let X€g and let x(-) be the one-parameter subgroup of ¢ 
underlying X. There is a compact subset K of G containing the supports 
of Xf and Raf, |t|S1. Then 


| v) —e(f,v)) —e(AF, v) | 
= | (Rayf—f) —Af, v) | S Ax || (Ref —f) —Af lle |v | 


if 0<|¢|<1, using Lemma 2 and the fact that Ryoe «0 (Ry XI), y€ G. 
We conclude that dU"(X )e(f,v) =«(Xf,v), and the lemma follows as in 
Lemma 4. 


ON INDUCED REPRESENTATIONS. 87 


Coromtary. Let fe C.*(G), XEE. Then ts a 
bounded linear map from VU to &. 


Proof. From Lemmas 2 and 6. 


4, Certain elliptic operators. Let VY; be a finite dimensional Hilbert 
space and (Xj, mi) be a measure space, t—1,2 is a measure 
defined on the o-field %; of subsets of X;). W:—L.(Ti; Vi) will then be the 
Hilbert space of all measurable functions f from X; to VY; such that | f || 


= ( f || f(z) |]? dus(x) (modulo those for which || f || =0). Ve) 


is also a Hilbert space under the norm || T || trace (T*T)*4 so that we 
can form the Hilbert space Wi. = X T2; $(Vi;V2)). If o€ Mie and 
f€W., then o(:,f) is in Therefore the map 
foo(-,f) is in 2(W.;AM,). It is easily seen that its (operator) norm is 
Letting Wo = L.(T2 XTi; £(V2;V:)), we easily see that the 
adjoint of f>o(-,f), ¢€ Ww, is the map g—>o*(-,g) where € M2; is 
defined by o* (2, y) =o(y,v)* for (a,y) € Xi. The next lemma deals 
with certain special kernels o. 

Lemma 7. Let X; be an open subset of R", & the Borel field of Xj, 
and Lebesgue measure on t= 1,2. Let o€ Wiz have the following 
properties: (1) there is an h€ L.(R") such that || o(2,y) || for 
(x,y) € (2) there is a null set NCR" such that o is continuous 
on [(a,y)€ Xo: e—yE NJ. Then, for all f€ W2,0(-,f) € C(X1; V1) 
and |o(-,f) SI. 


Proof. 


Now if f € Cy(X2; V2), we have o(2, f) = Except 


for y € N, the function y)f(x+y) is continuous from to Ui. 
Moreover, if B is bounded open subset of XY, and 8 is the support of f, then 
f (y) € £.(R") for all B. Hence 
the bounded convergence theorem applies to give the continuity of o(-,f) 
when f€ Cy(X.;U-.). The same for general f follows from the estimate 


ler- 
X). 
is 
X) 
is 
gly 
hat 
dy 
om 
ver 
G 
rts 
in 


88 ROBERT J. BLATTNER. 


lo(-,f) | f || for and from the fact that Co(X2; V2) is dense 
in Wz. 


Remark. If o is a kernel of the above type, so is o* (with interchange 
of XY, and Xz, etc.). 

We shall apply this lemma in proving a result on analytic elliptic linear 
differential operators with coefficients in (UV; V), VU a finite dimensional Hil- 
bert space. Let J, denote the non-negative integers. If = (f,---,n) € R" 
and s= -,Sn) € J,", we put Moreover, if | s | = > s;, then 
will denote the operator on all complex valued 
Cl*! functions defined on open subsets of R". Let O be an open subset of 
and let m€ J,. Let As(-) € C°(0; |s| Sm. The analytic linear 
differential operator L—= > A,(-)d!*!/dxz* in O of order m will be called 


|s|Sm 


elliptic if, for every O, Q(x;£) = A,(x)€* is singular only if £=0. 


|sj=m 
Although this definition of ellipticity is more restrictive than that given in 
[8], it is independent of basis in UY and suffices for our purposes. 


Proposition 3. Let L be an analytic elliptic linear differential operator 
of order m with coefficients in £(U;V) and defined in the open set O of R". 
Supposem >n/2. Let V be an isometry of L2(0;V) into a Hilbert space K. 
Let T be a densely defined operator in K such that VC\*(O;WV) C dom(7) 
and TVf = VLf for f€Co*(O;V). Then: (1) V*(dom(T*)) CEC(0;Y); 
(2) for every compact subset K of O there is a constant Cx such that 
|| SCr(|| V*T*v | + || V*v for all dom(T*). 


Proof. We follow closely the proof of Lemma 2.1 in [6] (cf. [15], 
Theorem 1). Let z)€O. Let O, be a bounded open neighborhood of 2, such 
that L has a fundamental solution H defined on O, X O, with the properties 
(see [8], Chapter IIT): (a) He C*°(D; £(U;V)), where 


D=[(z,y) € K.01: ry]. 


(b) || H(2,y)|| ||™"*) for all > 0, where || || is the Euclidean 
norm of x in R"; (c) for all y¢€0O,, LH(-,y) =0 on O, except at y; 


(d) g—Lf H(-,y)9(y)dy for all g € (01;Y). From [8], Chapter VII, 
we know that f H(-,y)g(y)dy€ C?(0,;V) for all g€ Co” 


Choose k€ C)*(O,) such that k=—1 in an open neighborhood O, of 2%. 
Exactly as in [6], we see that 


ON INDUCED REPRESENTATIONS. 


on O, for all g € Co*(02;V). 


For set é(2,y) =k(x)H(2z,y) or 0 and set »(z,y) 
=—L,[(1—k(z))H(2,y)] or 0, according as x€ O, or not. Properties (a) 
and (c) of H imply that » is a bounded continuous function on O X O, and 
so is a kernel of the type considered in Lemma 7%. So is é by virtue of properties 
(a) and (b) of H and the choice of m. Note also that é(-,g) € C.*(0;V) 
whenever g € Co” (02;V). 

Let v€ dom(7*). Then, for all g € Co*(O2;V), we have 


The first term 
= (v, VLE(-,g)) = (v, = (V*T*v, €(-,9) = (+, V*T* 0), 9), 


while the second term = (n*(-,V*v),g). We conclude that (V*v) (z) 
= &* (2, V*T*v) + (2, V*v) for almost all O,. From Lemma 7 and the 
remark following it we conclude the truth of our assertions in O,. The 
general statements are immediate because of the arbitrariness of 2p. 


We now apply Proposition 3 to prove the key result needed for our inter- 
twining number theorems. A member of LH will be called elliptic if it is 
elliptic regarded as a left invariant (analytic) linear differential operator. 
We use the notation of Sections 2 and 3. 


THEOREM 2. Suppose dimU<o. Then H,~CC(G;V). Suppose, 
moreover, that X, ts an elliptic element of © of order m>n/2, where 
n=dimM. Then for every compact subset K of G there 1s a constant Cx 
such that || SCx(|| 0U4(X_)g || + ||) for all ge H.. 


Proof. Clearly there is nothing to prove if n 0 (in fact, in this case, 
the assumptions of ellipticity and order are unnecessary—we may take Y, = 0). 
So assume n= 1. Let p.€ M and let ¢ be a C% diffeomorphism from the open 
unit sphere O, of R” to an open neighborhood of p, in M over which there is 
defined a cross-section « into G. Set B=aod and ton. The map 
of O, onto ¥*(O,) defined by w(é, p) = €B(p) is a diffeomorphism. 
Let » be right Haar measure on @ and set yyw, a Radon measure in 
rx 0O,. Let A be the Cartesian product of right Haar measure in T and 
Lebesgue measure in O,. If y is the density of v with respect to A, clearly 
For &€T and (é,p)€T X0,, set p) = p). 


89 

ige 

oar 

il- 

Pn 

en 

ed 
Rn 
ar 

ed 

0. 

in 

or 
. 

at 

h 
es 

¢ 


90 ROBERT J. BLATTNER. 


Then it is clear that v(é,S) =A(é,)v(S) and A(é,8) —8(é)A(S) for all 
€,€ and all Borel SCT X O,. It follows that y(&é, p)8(&) =y(é, p)A(&) 
for é,é,€ T and p€ 0,, so that y(e, p) =8(é)A(é) *y(€, p) for (é,p) €T X 

Now let O = (4) 0, and choose h € so that rh = 1 on $(0). 
If fe L.(0;V), define f on G by 


f(x) =y(e, w(x) (y (x) (28 (2) )*) (Y(2)) 
or 0 according as x€ y*(O) or not. Clearly f satisfies properties (1) and (2) 
in the definition of ¥* (Section 2). Moreover 


by the choice of h; and this, by the Fubini theorem and the definitions of y 


and f, is 


f(x) ||?h(2) de. 


G 


f, | 7(€B(p)) )dv(& p) = 


This shows that || /(-)||? is integrable on compact subsets of [x: h(x) > 0] 
and therefore of h(x) >0] > [2: (th) > 4] D f(z) KO]. 
Therefore is locally integrable on G and || || —||fl2 Setting Vf for 
fé L.(0;V), we see that V is an isometry of L2(0;V) into #. A similar 
calculation shows that (V*g)(-) =A(e,-)4g(B(-)), a.e. in O, for gé F. 
It is clear that V maps C,.*(O;V) onto the subspace of #,* consisting of 
functions whose supports Cy4(O), a subspace C dom(@U4(X)) and in- 
variant under 0U“(X) for all X€ &. 

If f is a complex valued C’ function defined on an open subset of T X 0, 
we set wf =fow*. X,* is elliptic. The analytic elliptic linear differential 
operator we write as Lo= in some 


|r|+|s|=m 


neighborhood of {e} X O adapted to the Cartesian decomposition of I X 0. 
Then, if f€ Co~(O;V), we have 


(V*0U" Vf) (p) = p)*(Xo'f) (B(P)) 
>» p)4are(e, p) 


Ir|+|s|Sm 


[ (8(€)4A(E) (y *f(p)) 


|s|=m 


ON INDUCED REPRESENTATIONS. 91 


where the A,€ The operator A,(-)dl#!/dp* is 
elliptic. In fact, if |s|—m, As(p) =4os(e, p)I. Hence 
— 2 A,(p) (2 dos(e, p)€*)I, 
8|=m 8|=m 


which is singular only if £0 by the ellipticity of Lo. 

Proposition 3 says: (1) if fe€ H,, then V*feC(O;V); (2) for any 
compact subset K, of O there exists a constant C’,, such that || V*f ||x, S C’r, 
(| || (we here make use of the fact that 0U4%(X,) 
C@U“(X,*)*). Thus f(8(-)) is continuous on O and we conclude that 


f(x) =8(2B8 (p(x) )*)4A (2B (2) )*) (7) ) 


is continuous on y+(O). From the arbitrariness of py and the existence of 
elliptic members of € of arbitrarily high order (see, e.g., [15]) we conclude 
the truth of the first assertion of our theorem. Moreover, the second assertion 
holds for any compact K Cy?(O): we need only take Cx = C’ yx). 


LUB[8(xB (w(x) )*) 4A (2B (x) )*) Fy (2) ) 4 |]: K]. 


The second assertion for general K follows easily from this and the arbitrari- 


ness of po. 


5. An intertwining number theorem. Let I, and I, be closed sub- 
groups of the Lie group G with modular functions 8, and 8, respectively. 
Let L™ be a unitary representation of TI; on the Hilbert space Yi, 11, 2. 
UL operates on &, We shall assume that dimU.<o. For each 
A€ R(UL?, UL”) we define a linear map r4 from C,*(G@) to the linear 
maps of Y, into Y. as follows: for each f€ Co*(G) and ve Y,, set ra(f)v 
= (Ae(f,v))(e). This definition makes sense: in fact, e(f,v) € #™, by 
Lemma 6, which implies that Ae(f,v) € #@,, by Lemma 4, so that Ae(f, v) 
€C(G;V) by Theorem 1 and the value of Ae(f,v) at e is well determined. 
For (&, €2) € T, and for any function f on G, we set =f 
r€G. We may now state our main theorem. 


THEOREM 3. Let X, bean elliptic element of © of order > 4dim(G/T.). 
For f€ (G4), set || f lx, =|] Xof la lle. For each relatively compact 
open set O of G, give Cyo*(O) the topology induced by ||-||x,; give Co” the 
corresponding inductive limit topology [1]. Let Im =the subspace of maps 


ZE L(Co* (4) (Vi; V2) ) such that 


Z ) = 6, 88. (€:6°* (f) LM 


all 
£1) 
2) 
)] 
|. 
ar 
yf 
n- 
al 


92 ROBERT J. BLATTNER. 


for all XT. and all fEC,.*(G). Then the map is a 
faithful linear map of into 


Proof. Let v€V,, AE and f€ C.*(O), O a relatively 


compact open subset of G. By Theorem 2, 
| ra ||] S Cee (|| Ac (Ff, v) |] + || Ae(F, 

But (X,)Ae(f, v) = AdUL™ (Xo)e(f, v) = Ae(Xof,v) by Lemmas 4 and 
6. Therefore 

| ra(f)v | S Ceol A e(Xof, v) | + Il) 

S || A || Ad || (|| Xof le + | fle) = | All f 

This proves that ra € (Co? (G) ; Vz) ). 

Suppose r4=0. Let f€Co*(G) and v€V,. For all G we have 

(Ae(f,v)) = Ac(f, v)) = (AU €(f, v)) (e) = ra(Raf)v =0 


because UL,o€ XI). Therefore A is 0 on e(Co*(G)XW:1), a 
total subset of #® by the remark following Lemma 2. 

To show that let fECo*(G), vE Vi, and (&,é) XT». 
Then 14(p¢,¢f)v = 1a (Pepe, = (Ae(pe, cf, v) (€2) exactly as above, and 
this = 8, (&)4A(é.) (pg, ef) by property (2) for functions of 
On the other hand, 


— 8, Lg,20) (2). 
Therefore r4€ 


If V and W are representations of G, dim #& (V, W) is called the inter- 
twining number of V and W and is denoted by 1(V,W) ([{10]). Theorem 3 


has the following consequence. 
6. Inducing from compact subgroups. For notation, terminology, and 


facts about direct integrals of Hilbert spaces, we refer the reader to [4]. 
Let X be a separable locally compact space and let » be a positive Radon 


ON INDUCED REPRESENTATIONS. 93 


measure for X. Let t—%(t) be a field of separable Hilbert spaces and let 
G be a separable locally compact group. A field of (unitary) representations 
of G is a map which assigns to each ¢€ XY a representation U(t) of G on 
H(t). If M(t) is measurable, then U(t) is called measurable if 
t->U,(t) is a measurable operator field for each r€ G. We know ([10], 


that o> f U,(t)du(t) is a representation of G on f & (t)dy(t), which 


® 
we denote f U(t)dp(t). 
If V and W are representations of G,J(V,W) will denote the dimension 
of the subspace of all Hilbert-Schmidt operators in 0 (V,W) and will be 
called the weak intertwining number of V and W ([10]). 


Lemma 8. Let (X,) be atom free, let t> H(t) be a measurable field 
of Hilbert spaces, and let t—U(t) be a measurable field of representations 
of the separable locally compact group G. Let V be a representation of G 
on the separable Hilbert space K. Then the Y=[t€X: J(U(t),V) >0] 

® 
is measurable. Moreover U(t)du(t),V) or o according as 
p(Y) =0 or > 0. 
Proof. Let &’ be the conjugate space of K and let V’ be the repre- 


sentation of G in K’ defined by V’,—*tV,z. The field of Hilbert spaces 
t— &%(t) ® K’ is measurable in a canonical fashion which sets up a unitary 


® 
equivelenoe of f (b(t) K’)du(t) with ( (t)du(t)) ([4], Ch. 
II, §1, Section 8). The field of representations t— U(t) @ V’ is then measur- 
able (ibid., §2, Section 1) and the above unitary equivalence then give a 
® 
unitary equivalence of f (U(t)@V’)du(t) with U (t)du(t)) @V’ 
(ibid., §2, Section 6). Now J(U(t), V) =number of times U(t) @V’ dis- 
cretely contains the one-dimensional identity representation of G and similarly 


® 
for mf U(t)dp(t),V) ({10], Lemma 8.1). Our lemma now follows from 


[10], Lemma 13. 1. 

We are now in a position to prove the main result of this section. We 
return to the situation of Section 5. Two representations are said to be 
disjoint if no non-0 subrepresentations of one is unitarily equivalent to any 
subrepresentation of the other. 


THEOREM 4. Suppose that T, and YT, are compact and that dim VU, and 


94 ROBERT J. BLATTNER. 


dim YU. <0. For each G, let be the representation of defined 
by €€ Suppose that, for locally almost all xe G, 
the resirictions of L® and ,L® to T,N(aT.a*) are disjoint. Then U%™ 
and UX are disjoint. 


Proof. Let B be a positive definite bilinear form on g invariant under 
the action of T, on g via the adjoint representation of G on g. Choose a 
basis {X;} of g, orthonormal with respect to B. Exactly as in the discussion 
of the Casimir operator in [14], Exposé n° 4, we see that A=) X;* is 
invariant under adI, (where the adjoint representation of G has been 
extended to ©). Moreover 1—A is an elliptic operator on G such that 


f [(1—A)f]f= f |f |? for all f€ C.*(G@) (see, for instance, [15]). 


Now suppose U and U are not disjoint. Then I(U2”,U4®)> 0. 
Let k be an integer > (4)dim(G/T,) and set Y¥)>—=(1—A)*. Using X, 
to define 9, we see from the Corollary to Theorem 3 that dim 97> 0. The 
map (2, ) > of G X T.) into G gives the pair (G,T; X 
the structure of compact analytic transformation group. Hence there is a 
Z€ M and a relatively compact open set O invariant under the action of 
Yr, XT, such that Z|C,)*(O) 40. Let be an integer > + (4)dim(G) 
and let S be the linear operator in L,(O) with domain C,*(O) defined by 
Sf=(1—A)’f, f€ (0). SZI on Cyo*(O) and so is univalent. We 
define a representation P of T, XT. on L.(O) by setting Pe, ¢f 
f€L.(0). Now Acp=poA because A is left-invariant and is invariant 
under ad T,. Therefore SPz,¢,—= Pe,¢S so that S-*P¢,¢,—= for all 
(6) €T, XT, Setting we have a mapping T from a sub- 
manifold of L.(0) to such that TPe¢,¢f = for all 
f€dom(T). If we identify £(VU.;V.) with V.®@V,’, we therefore obtain 
the relation TP¢,¢, = (&,&) XT2, where Ve,g,—= Lg, L%¢, 
(see [10], §5). Let 8; = (1—A)?|C,*(G@) and (1—A)!*| C,*(G), 
operators which are symmetric and =J in L,(@). Applying Proposition 3 
to S, and 82, we obtain constants C, and C, such that || f |g C, || S,f || and 
|| Sof || for f€ Co" (G@). Since 8S, on C,* (O), it follows 
that ||flx=|| Xof (C1 +C2)|| for all feC.-(O). We 
conclude that 7 is continuous. Therefore, since dom(7') is invariant under 
P, the unique bounded extension 7° of T to L.(O) which vanishes on dom(T7')+ 
belongs to R(P,V) >0, and I(P,V)>0. Since dim(V.®@ V2’) <0, it 
follows that J(P,V) >0. 


O, being open and relatively compact, is a separable locally compact 


95 


ON INDUCED REPRESENTATIONS. 


space. Let 96 be the space of orbits of O under the action of I, XI:. 
% is also a separable locally compact space in the usual topology. We put a 
Radon measure v; on each (compact separable space) ¢€ % as follows: pick 


a point 2» € ¢ and then set f f(a) = és) for all 
f¢€ C(t), where Haar measure on T; X I, is normalized so that f 1d(&1, 2) 


=1. The two-sided invariance of Haar measure on compact kroups shows 
that 4 is independent of the choice of 7) € ¢. We also define a measure p» on 


% by setting f(t) dp(t) (fom) (x)dx for each f€ C,(X), where 
x 

7 is the usual projection of O on %. Let H(t) —L.(t,%), a separable 

Hilbert space for each t€ 96. Each f€ C,(O) defines a vector field 6f, where 

(Of) (t) =f|t. If f,géC,(O), then 


((Af)~(x)), (89) (w(a)) ff (Gres) 9 (Er 


whence ¢—> ((@f) (¢), (6g) (¢)) is continuous on 26. Moreover, if {fn} is a 
multiplicatively closed sequence of real functions in C)(O) which separates 
the points of O, the Stone-Weierstrass theorem implies that the sequence 
{(@fn) (¢)} is total in M(t) for all x€ %. Thus the field of Hilbert spaces 
t— &#(t) has a unique measurable structure such that all vector fields in 
G)=6C,(O) are measurable. Now 


f |? da = | f(x) |? dx = | f |l’, 


for all f€ C.(O), by the Fubini theorem and the invariance properties of 
® 
Haar measure on G. Therefore we may identify Z.(O) and f H(t) dy(t). 


For each ¢€ 2%, we define a representation P(t) of T,; XT, on M(t) by 
setting Pe, ¢,(t)f =pe,ef for fe M(t), (&,6) ET, Since 


Pe,g,(t) (Of) (t) = (OP (¢) 


for each (€,é) €T, KX T2, f€ Co(G@), and t€ 9, we see that the field of 
representations ¢—> P(t) is measurable and that P may be identified with 


f P(t)dy(t). From Lemma 8 and the fact that V is of finite degree we 


96 ROBERT J. BLATTNER. 


conclude that the u-measurable set Y = 9%: I(P(t), V) > 0] has positive 
p-measure. 

Fix t¢€ % and pick zx, €¢. The stationary subgroup of z, under the 
action of XT. is too): EE TL N ]. Tf fe H(t), 
define x:f on by (Xf) &) =f It is easy to see that 
X; identifies P(t) with the representation of T, XT, induced by the one- 
dimensional identity representation of T,,. Therefore, according to the 
Frobenius reciprocity theorem for compact groups ([{16], pp. 82-83), 
I(P(t), V) number of times V restricted to [,, contains the one-dimen- 
sional identity representation. But this is just the weak intertwining number 
of the restrictions of L® and to ([10], Lemma 8.1). 
We conclude that the restrictions of L@ and ,L® to Ty N(aP.a") are not 
disjoint whenever x€ r4(Y), a measurable set of positive measure in the 
compact set O. Our theorem is thereby demonstrated. 


Remark. As usual (cf. [10]), the intertwining number of the restrictions 
of Z® and 2& to T, AM (aT.e.-1) depends only on the T,: ©, double coset 
T,2T, to which z belongs. 


7. An example. Let G be the group of (possibly improper) rigid 
motions of the Euclidean plane II, let P€ II, and let © be the group of 
(possibly improper) rotations of II about P. Let LZ“ (resp. Z©) be the 1- 
dimenstional identity representation of T (resp. the 1-dimensional represen- 
tation of T which assigns to €€ T the value of + 1 or —1 as é is proper or 
improper). If 7 is the subgroup of translations of II, G—TT shows that 
each [': I double coset of G contains a member of 7. Let e~2¢€T. Then 
zVx* is the group of (possibly improper) rotations about zP, so that 
['M(2l2*) is the group consisting of e and reflection in the line through P 
and Therefore the restrictions of and ,L™ to TM are dis- 
joint, and by the remark following Theorem 4, this holds for all a¢ 1. The 
same holds if x€ for then From Theorem 4 we conclude 
that and are disjoint. 


We can derive the same result from Mackey’s reciprocity theorem. Since 
G is a regular semi-direct product of the abelian normal subgroup T and TI, 
we may apply the analysis of ([10], § 14) to obtain the following information 
about the irreducible representations of G. Let T be the character group 
of T and let TF act on T according to the rule (xé) (x) =x(év&") for €€T 
xeT,c€T. Let © be the space of orbits of J under I and, for each w€ Q, 
choose X,»€w. Let IT, be the stationary subgroup of xX» in T. Then each 


ON INDUCED REPRESENTATIONS. 97 


irreducible representation of G is uniquely specified by a »€ Q and an irre- 
ducible representation M of T,, and conversely. Calling this representation 
VeM, the restriction of V’™ to T is the representation of T induced by M. 
Now if wo = {e}, then T,,—T; and if o> {e}, then [, is the subgroup con- 
sisting of the identity and the unique reflection in T leaving xX, fixed. More- 
over if M is an irreducible representation of TI, we have that rU™ contains 
L©) exactly as many times as the restriction of L© to IT, contains M (the 
Frobenius reciprocity theorem for compact groups). Thus in any case rU™” 
contains L) if and only if it does not contain L©. Finally from results of 
Godement ([7], Theorems 5 and 7) and Kaplansky ([9], Theorem 7), we 
know that the regular representation of G is of Type J. Therefore Mackey’s 
reciprocity theorem ([11], Theorem 5.1) applies, and we deduce that UL“ 
and U“© are disjoint. 

It is instructive to note that in order to apply Mackey’s reciprocity 
theorems to verify the conclusion of our Theorem 4 in any particular case one 
needs to know (1) how the regular representation of G decomposes into factors 
and (2) how these factors, when restricted to T, and IT, decompose into 
irreducible representations. Even in so simple a case as the motion group 
considered above, the machinery needed to dig out these facts is quite formid- 
able. Thus the reciprocity theorem would not seem well suited in general 
to handle the disjointness question dealt with in Theorem 4. 


UNIVERSITY OF CALIFORNIA, 
Los ANGELES. 


REFERENCES. 


[1] N. Bourbaki, Hspaces vectoriels topologiques, Chapters I-II, Hermann, Paris, 1953. 
[2] , Intégration, Chapters I-IV, Hermann, Paris, 1952. 


[3] F. Bruhat, “Sur les representations induites des groupes de Lie,” Bulletin de la 
Société Mathématique de France, vol. 84 (1956), pp. 97-205. 


[4] J. Dixmier, Les algébres d’opérateurs dans Vespace hilbertien, Gauthier-Villars, 
Paris, 1957. 


[5] L. Garding, “ Note on continuous representations of Lie groups,” Proceedings of 
the National Academy of Sciences, vol. 33 (1947), pp. 331-332. 


[6] —-—_, “Applications of the theory of direct integrals of Hilbert spaces to some 
integral and differential operators,” Institute for Fluid Dynamics and 
Applied Mathematics Lecture series, No. 11, University of Maryland. 


ROBERT J. BLATTNER. 


[7] R. Godement, “A theory of spherical functions I,” Transactions of the American 
Mathematical Society, vol. 73 (1952), pp. 496-556. 


[8] F. John, Plane waves and spherical means, Interscience, New York, 1955. 


[9] I. Kaplansky, “Group algebras in the large,” Téhoku Mathematical Journal, vol. 
3 (1951), pp. 249-256. 


[10] G. W. Mackey, “ Induced representations of locally compact groups I,” Annals of 
Mathematics, vol. 55 (1952), pp. 101-139. 


[11] ———, “Induced representations of locally compact groups II,” ibid., vol. 58 
(1953), pp. 193-221. 


[12] E. Nelson and W. F. Stinespring, “ Representation of elliptic operators in an 
enveloping algebra,” American Journal of Mathematics, vol. 81 (1959), 
pp. 547-560. 


[13] I. E. Segal, “A class of operator algebras which are determined by groups,” Duke 
Mathematical Journal, vol. 18 (1951), pp. 221-265. 


[14] Seminaire “ Sophus Lie” 1954-55, Ecole Normale Superieure, Paris, 1955. 


[15] W. F. Stinespring, “Integrability of Fourier transforms for unimodular Lie 
groups,” Duke Mathematical Journal, vol. 26 (1959), pp. 123-131. 


[16] A. Weil, L’intégration dans les groupes topologiques et ses applications, Hermann, 
Paris, 1940. 


98 


ON CHOW VARIETIES OF MAXIMAL, TOTAL, REGULAR 
FAMILIES OF POSITIVE DIVISORS.* * 


By J. P. Murre. 


Introduction. In this paper we study the Chow variety of a maximal, 
total, regular family of positive divisors on a non-singular projective variety 
V. An algebraic family U of positive divisors on V is called mazimal if U 
is not a proper subset of another algebraic family; it is called total if for 
every divisor Y on V, algebraically equivalent to zero, and for an arbitrary 
fixed (i.e. independent of Y) X,€WU there exists an XY €%U such that 
Y¥~X—X, (~ means linear equivalence) ; finally, it is called regular if for 
every pair XY, X’ € U we have 1(X) =1(X’), where /(X) denotes the dimension 
of the linear system determined by X. These definitions are introduced, and 
the existence of such families is proved, in [6,7] (in [6,7] such families are 
called maximal, regular, complete instead of maximal, total, regular in this 
order). 

If V is embedded in projective space P’, then the Chow points are con- 
structed by means of the hyperplanes in P¥, and therefore we must expect 
that there is some connection between the properties of the Chow variety U 
of U (for instance the non-singularity of U) and the way V is embedded in 
P' (or to be more precise, the properties of the linear system of hyperplane 
sections on V). Our main purpose is to show that, under a mild condition 
on the embedding of V in P¥, the Chow variety of a maximal, total, regular 
family is non-singular. As a preparation to this result we first study the Chow 
varieties of linear systems on V and it turns out that, under the same con- 
dition on the embedding of V in PY, the Chow variety of a linear system is 
non-singular (Proposition 1). 

As we have just mentioned we have to assume some properties for the 
linear system of hyperplane sections on V in order to be able to prove the 
non-singularity of U; if these properties are fulfilled, we shall say that V is 
adaptable embedded in PN (see the definition in Section 2). However, this 


is not a very serious restriction, for we shall see in Section 2 that the embedding 


* Received July 25, 1960. 
*This work was supported at Northwestern University by the National Science 
Foundation under project NSF-Gg506. 


99 


an 
ol. 
of 
58 

n 

) 
ke 

ie 

n, 

| 


100 J. P. MURRE. 


V’ of V by means of the hypersurface sections of degree m (m>1) in P% 
always has these properties (Lemma 5). This result can also be interpreted 
in the following way. If we construct the Chow points by using hypersurfaces 
of degree m (m>1) instead of hyperplanes, then the Chow varieties of 
maximal, total, regular families (and also of linear systems) are non-singular. 

In Section 1 we study the degrees of Chow varieties; the Lemmas 1 and 
2 are generalizations of results of Chow in [8]. 

I am very grateful to T. Matsusaka for his valuable help and encourage- 


ment during the preparation of this paper. 


1. All varieties under consideration are assumed to be projective varieties; 
therefore the degree of a variety is defined. The ambient projective space of 
a variety is usually denoted by P’ and Zo, Z;,---,Zy are used as letters for P; 
the ambient projective space of the Chow variety of a system of positive cycles 
(of a certain dimension and a certain degree) in P% is usually denoted by P*, 
and for P* we use the letters Yo, Y,,- - -,¥Y:. If X is a positive cycle in P%, 
then its Chow point is denoted by Ch(X); the degree of a variety V is 
denoted by deg V. 

If P is a projective space and if uj (j =0,---,N;1—1,°- -,n) isa 
set of elements of the universal domain Q, then we mean by the linear variety 
defined by the set of elements (w;) (or sometimes shortly (w)) the variety 


N 
defined by the set of equations (t=1,- -,n) in 
j=0 


First, we mention some facts which will be used frequently in the 
following. Let V” be a variety in P% defined over a field & and of degree h. 
Let (ujo) where r is some 
integer) be a system of independent transcendental elements over k, and let 
Le (o=1,-- -,7) be the linear variety defined by the set (uijc) (o fixed). 


h 
Then Lo: V => Qoa, where all the Qoa are different from each other. If 
a=1 


Keo denotes the field obtained by adjoining to & all the (u,r) with +o, 
then Qoa and Qog (8a) are independent generic points of V over Ka 
[5, Chap. VIII, Prop. 10]. Therefore, given two arbitrary sets of indices 
and --,8,) with 1=a;,8;<)h, there exists a k-automor- 
phism of the universal domain transforming Qoa, into Qogs (c =1,: 
Furthermore, let U be the Chow variety of an algebraic system of positive 
divisors on V. Let He be the hyperplane in the ambient projective space P' 
of U defined by the equation Sp Mp(uij,c)¥p 0 (for some fixed o), where 
the Mp(Ui;) range over the monomials of a suitable degree in the Uj; (the 


FAMILIES OF POSITIVE DIVISORS 101 


monomials occurring in the Chow forms of the divisors in the algebraic system 
associated with U). Such a hyperplane will be called a hyperplane derived 
from the set (wij,c). Then we have by the properties of Chow coordinates 
(see [1]) for a divisor XY in the system that Ch(X) € UM Ho if and only if X 
(or better the point set |X |) contains some point Qoa. 

Lemma 1. Let V" be a complete variety, non-singular in codimension 1 
and of degree h. Let U" be the Chow variety of an algebraic family U of 
positive divisors on V. Then deguU Zh’. 


Proof. Let k be a field of definition for V and U. Let (uiyjc) (7 =0, 
--,N;i=—1,:-+-,n;0—1,:--,7r) be a set of independent transcendental 
elements over k. If Io (o—1,: + +,17) is the linear variety in the ambient 


h 
projective space of V defined by the set (wij) and if Lo- V= Qoa, 
Q@=1 


then we can apply to this intersection the remarks preceding the lemma. Let 


furthermore He (o =1,-: - -,1r) be the hyperplanes, in the ambient space P* 
of U, derived from the sets (uij.c). 
First of all, we want to show that U-H,-----H, is defined. Clearly, 


it suffices to show that 7-H, is defined. Let X be a divisor in U and algebraic 
over k, Since every Qi is generic over k on V, it follows that XY contains no 
point Q1«. Therefore (see the remarks preceding the lemma) Ch(X) ¢ UN H,, 
ie. UC H,, i.e. U-H, is defined. 

Since UN H,N::-NH,~@ for dimension reasons and since V is 
complete and non-singular in codimension 1, it follows that there exists a 


divisor XY in U and a set of indices (a,,- - -,a,) such that Qoa,€ |X| for 
let us assume for convenience that a = 1, i.e. that Qo: € | | 
Given any two set of indices (@,,- - -,a,) and (B,,° -,8,), 


we have seen above that there exists a k-automorphism of the universal domain 
transforming Qoa, into Qog,. Since there are precisely h” such sets, it suffices, 
in order to prove the lemma, to show that, for some set of indices (a,,- - -, %,), 
there exists a divisor X* of WU such that Qoa,€ | X*| for o=—1,-- -,r but 
Qog€|X*| for all BAae (o—1,---,7r). In order to see that, let Lo’ 
(c=1,: -,r) be a linear variety in defined by a set (w’ijo) which is 
such that: 

1. Lo’ goes through Qo, -,7), 


2. over the field K =k(Q11, Qo1,° +; Qn,Ch(X)) the set (u’) is a 
generic set which fulfills the condition 1. 
Since the Qo, are r independent generic points of V over i, we clearly have 


over & the generic specialization (w) — (w’), where (w) stands for the entire 


PN 
ted 
of 
lar, 
nd 
ge- 
ies; 
of 
IN. 

les 
N 

is 
ty 
ne 
h. 
1e 
at 
). 

If 
), 
e 
h 


102 J. P. MURRE. 


with, say, Q’c: = Qo:; from the remarks preceding the lemma it follows that 
Q’oa is a generic point of V over K fora~1. Extend the generic specializa- 
tion (u) — (u’) to a generic specialization (w’, Q’,X) — (u,Q,X*) over k, 
where the set (Q) contains all the Qoa, and (Q’) similar. Let under this 
specialization Q’o, correspond with Qoy,. Now Q’oi€ |X |, but since Q’ca for 
a1 isa generic point of V over K = K(Ch(X)), it follows that Q’ou¢ | X 
for a1 and o=1,:--,r. Therefore it follows that Qo,,€ | X*| but 
Qop¢ | X*| for BAyo This completes the proof. 


Lemma 2. Let V" be a complete variety, non-singular in codimension 1 
and of degree h. Let U" be the Chow variety of a linear system & of divisors 
on V. Assume that & is without fixed component. Then deg U =h’. 


Proof. Let k be a field of definition for V and U and such that the 
linear system has a module in the function field? which has a basis of func- 
tions defined over k. We have to consider two different cases. 


Case 1. Suppose a generic element of & has no multiple components. 
Let Ic (o=—1,---,7) be the linear varieties in P’ defined by the sets of 
elements (7 where all the are transcen- 


h 
dental and independent from each other over k. Let Lo-V => Qoa; then we 


have noted above that Qoa and Qog (%) are two independent generic points 
of V over the field Ko (introduced above). Furthermore, for any set of indices 
with LS we have that is a set of r indepen- 
dent generic points of V over k, and therefore there is precisely one element 
X in the linear system going through Qia,,: *,Qra,. If we denote this 
by Xa,---a,, then is rational over the field Ka,.--a, =k(Qiay* * Qra,). 
Since, as we have seen above, Qog for B4ao is a generic point of V over 
this field, it follows that Xa,...«, does not contain Qog. Therefore Xa,..-c, 
A~ Xp,..p, if for some o the a Bo. 

Next, let Ho (o =1,: - -,7) be the hyperplane in the ambient projective 
space of U derived from the set (uij,c). Then we have, according to the 
remarks made above, that UN H:N:--OH, for every set of 
indices (@,,° - *,@%,) and moreover these are the only points in this inter- 
section. Since we obtain in this way precisely h” (different) points, it suffices, 
in order to complete the proof, to see that every point has multiplicity 1 in 


1 Let £& be a linear system on V; let 2 be the universal domain and 2(V) the 
function field of V. We shall say that a vector space M CQ(V) over 2 is a module 
for & if there exists a (fixed) divisor Y on V such that (f)-+ Y€ & for all functions 
f¢M and if conversely for every X € & there exists an f€M such that X = (f) + Y. 
Such a module always exists; in particular, if we take an X,€ &, then the set L(X,) 
= {f|f€2(V) such that (f) = Y —X, with Ye £} is a module for £&. 


FAMILIES OF POSITIVE DIVISORS 103 


this intersection. From the fact that Yo,...a, contains the points Q1a,,°**, Qra,s 
which are r independent generic points of V over k, it follows easily that the 
point Ch(X«,..-«,) is generic on U over k; therefore it is in particular simple. 
Since (as is easily seen) the Xq,...2, are conjugate to each other over the field 
obtained by adjoining the coefficients uj; of the Le to the ground field k, 
it suffices to consider only one of the Xq,-.-a,, and let us write XY instead of 
Xo,--a,, We have to show that H,-- - --H, is transversal to the tangent 
space to U at Ch(X). Since the H,,---,H, are r independent generic 
derived hyperplanes over the ground field &, it suffices to show that there are 
some r derived hyperplanes H’,,- - -,H’, such that H’,-- - --H’, is defined 
and transversal with the tangent space to U at Ch(X) (for then over the 
specialization H;— H’; over k there is an Xg,...g, specializing to Y and hence 
H,:: + -:*H, is transversal to the tangent space to U at Ch(Yg.,-.-g,), which 
is sufficient since Xo,--a, and ‘g,...g, are conjugate). However, now we 
contend that the intersection of all the derived hyperplanes through Ch(1) 
is Ch(X) itself, which is clearly sufficient for the existence of r derived hyper- 
planes H’,,- - -, H’, with the above mentioned property. In order to see that 
the contention is true let Ch() = (¢,) and let (d),) be an arbitrary point 
in the ambient space Pt of U. Consider the Chow form = F(U) 
and the form }d,\M,(U) =G(U). We must try to find a set (w’) such that 


F(u’) =0 and G(u’) ~0. However, if (c,) ~ (d,), then the existence of 
such a set (w’) follows from the fact that F(U) has no multiple factors 
since Y has no multiple components. 


Case 2. Suppose a generic member X (over &) of & has multiple 
components. According to [9, Th. 1.6.3] we have VY = p*\’, where X’ has 
no multiple components and p is the characteristic of the universal domain. 
In order not to interrupt the arguments later on we first state an auxiliary 


lemma. 


Lemma 3. Let U bea projective variety defined over a field k of charac- 
teristic p and let P be a generic point of U over k. Let U* be the locus of 
P® over the field k®, where P®® is the point obtained by raising the coordi- 
nates of P to the p*-th power. Then deg U = deg U*. 


For the proof of Lemma 3 it suffices to remark that we obtain U* from 
U by applying the Frobenius automorphism p— p”” to the universal domain 
and that by this automorphism hyperplanes go over in hyperplanes. 

Returning to the proof of Case 2 of Lemma 2, take a fixed divisor V, 
in £ and let & be an algebraically closed field over which V and U are defined, 
over which Y, is rational and such that the function module L(X,) of & has 
a base of functions defined over k. Let X be a generic member of & over k; 
then we have ¥ = p*X’ as we have seen above. Let U’ be the locus of the 


Lat 
k, 
ls 
or 
ut 
1 
le 
yf 
re 
) 
t 


104 J. P. MURRE. 


Ch(X’) over &. If Ch(¥’)¢€U’, then Ch(p*¥’)€U, and conversely if 
Ch(¥Y) € U, then Y = p*Y’ with a Y’ such that Ch(Y’) € U’; in particular 
with Ch(X,’)<«U’. By [9, Prop. 1.6.4] we have L(X,) 
C (a(V))®, where Q is the universal domain and Q(V) is the function field 
of V overQ. If Ch(Y’) € U’, then p°eY’ = X, + (f) with f€ L(X_) and hence 
Y’— = with Q(V), and hence Y’~X,’. Moreover, it is 
then easily seen that U’ is the Chow variety of a linear system &’ without 
fixed components, a generic element X’ of which has no multiple components. 
Let P* be the ambient space of U’. Let(c,) =Ch(X’) and (d,) =Ch(X); 
then we can rearrange the coordinates d, such that d, —c,”* for y—1,- - -,? 
and d,=0O for v=?#+1,:--,¢. Therefore there is a projection of the 
ambient space Pt of U onto P* such that the image of U is the variety U*, 
where U* is the locus of (Ch(A’))@* over & and clearly deg U = deg U*. 
By Lemma 3 deg U* = deg U’ and deg U’ = h" by Case 1, which completes the 
proof. 

2. Let V" be a complete, projective variety defined over k. 

Lemma 4. Let & bea linear system of divisors on V having the following 
properties: 


1. If P, P’ are any two simple points on V, then the linear subsystem 


£(P,P’) of &, consisting of all divisors of & going through P and P’, has 
no base points (excepl P and P’). 


2. If K Dk is any field such that & has an associated function module 
with a basis of functions defined over K and if ¥;,: + -,¥» are n independent 
generic members of £(P,P’) over K(P,P’), then Y,-- + -+¥Yy ts defined and 
equal to 1-P+-1-P’+ W, where W is a cycle not containing P and P’. 
Under these assumptions, if X,,- + -,Xn are n independent generic members 
of £(P) over K(P), then: 


a. ts defined and equal to 
b. if 17, then Q; and Q; are independent generic points of V over K(P). 


Proof. First, we remark that if A is a subvariety of V algebraic over 
K(P) and different from V itself, then X¥,N---NX,NA=QM. For it 
follows from 1. that ¢(P) has no fixed points and therefore a generic X; 
does not contain A, and it follows that the dimension of every component of 
X,A is smaller than the dimension of A. By repeating the argument we 
see that NowletQeE Xin: 
then it follows from the above remark that Q is a simple point on V. Next, 
let Yi,- - -, Yn be as in 2. for Q); we have (X,,- (¥i,° Yn) 
over K(P). Therefore X,-- - --X, is defined, and P and at least one other 


¥AMILIES OF POSITIVE DIVISORS 105 


point in -:X, have multiplicity 1. Put K,—K(P); let ¢y 
be a base for a function module of ¥(P) such that all ¢; are defined over 
K,; moreover, we can assume that all 4; are defined at Q and that at least 
one ¢;, say $1, is such that ¢,(Q) 0 (since Q is no base point for €(P)). 


N 
Let X¥;= (DS wipdp)o, i= 1,: with the wip independent transcendentals 
p=1 


over K,. We have dimg, K,(u,Q) =dimxr, Ki(u) =Nn. Since 

we see that Ki(Q,u) = (N—1)n. Then it follows from the tower 
K,C K,(Q) C Ki(Q,u) that 


dimx, K,(Q) =n and K1(Q,u) = (N—1)n. 


Then X,,---,X, are nm independent generic members of £(P,Q) over 
K (P,Q) ; for, the dimension of £(P) being N and Q being not a fixed point 
of £(P), the dimension of (P,Q) is N—1. Then a. follows from 2. 
applied to (P,Q). Now let Q; and Q; be as in b. From what we just have 
seen it follows that X,,- - -,X, are n independent members of &(P, Qi) over 
K(P,Q:). If the locus of Q; over this field has a dimension smaller than n, 
then, since $(P,Q;) has by assumption no base points (except P and Qj), 
it follows by the same argument as in the beginning of this proof that 
X,M: + +X, has an empty intersection with every component of the locus 
of Q; This being a contradiction, it follows that Q; has dimension n over 
K(P, Qi). 

Lemma 5. The linear system Lm (m>1) of hypersurface sections of 
degree m on V is ample and has the properties 1. and 2. of Lemma 4. 


Proof. It is well known that &» is ample. Given P and P’€ V and an 
arbitrary point Q in the ambient projective space P* of V, then there clearly 
exists a hypersurface of degree m (if m>1) through P and P’ but not 
through Q. As to property 2., let Hy» Hy, (t=1,---,n), 
where ZL“ is a hyperplane through P and H,,_,“ is a hypersurface of degree 
m-—1 through P’, and moreover we take generic L® and H,»_,“ with these 
properties and all independent from each other (over a field K(P,P’) as in 
Lemma 4). Since in particular ZL does not go through P’ and H™ not 
through P, we have if we put V-H,, = X/ that X,’:- - - +X,’ has property 
2 in Lemma 4, so certainly for generic X; we have this property. 

Definition. A variety V" is adaptable embedded in projective space P% if 
V C PN and if this embedding has the following property. If & is a field of 
definition for V and P a simple point on V rational over k, and if LN-" is a 
linear variety through P but otherwise generic over k, then: 


if 
ar 
o) 
ld 
ce 
is 
t 
Ss. 
“Ab 
1e 
( 
| 


106 J. P. MURRE. 


i=2 


2. for every pair (1,7) with 147, the points Q; and Q; are independent 
generic points of V over k. 

Lemma 6. Let V" be a variety in projective space PN. Let V’ be the 
embedding of V into projective space P“ by means of the hypersurface sections 
of degreem (m>1). Then V’ is adaptable embedded in P™, 


Proof. Instead of considering V’-L in P™ we can consider in P the 
intersection V-H, where H is a complete intersection of n hypersurfaces of 
degree m, independent generic from each other over the field & in considera- 
tion (except for going through the given point P). The lemma follows then 


from Lemma 5 and Lemma 4. 


38. Lemma 7. Let U be the Chow variety of an algebraic system of 
positive r-dimensional cycles in PN, defined over a field k with a generic 
element X over k. Let Z be a k-rational positive r-dim. cycle. Let W be the 
locus of Ch(X + Z) over k. Then there is an everywhere biregular birational 


transformation between U and W. 
t 
Proof. Let U be in P*. Let S&M)(U) be the Chow form of XY and 
\=0 


let F(U) =D &*M)(U) be a form with generic (&*) over k. Suppose 

>«,NV,(U) =G(U) is the Chow form of Z. Put F(U)-G(U) =H(U) 
8 

= 2 (U). Then W is in Furthermore, n, = p,(é*), where the p, 

are linear forms in the é*. These forms define a projective transformation 

of P* into P*; since it follows clearly, from the way the y, are defined, that 

there is no (é’) such that all p,(é’) are zero, it follows that P* is transformed 

in a one-to-one manner to a subspace of P%. W is then clearly the projective 


transformation of U.? 


Proposition 1. Let V be a complete variety, non-singular in codimen- 
sion 1 and adaptable embedded in projective space. The the Chow variely 


of a linear system is non-singular. 


Proof. Let U" be the Chow variety of a linear system ¥; let deg V" =h. 
By Lemma 7 we can assume that & has no fixed components. Let X € ¥&; 
since & has no fixed components, there is a point P € | X | which is not a base 
point for Y; we can assume that P is simple on V. Let & be a field of 
definition for V and U, such that X and P are rational over & and such that 
¥ has a function module with a basis of functions defined over k. Let L'~" 


2 It follows from this that the assumption in Lemma 2 that £ has no fixed com- 


ponents can be omitted. 


FAMILIES OF POSITIVE DIVISORS 107 


be a linear variety in the ambient space P’ of V, going through P but 
otherwise generic over & defined by a set of elements (uj) (7=0,°°°,N3 


h 
i=1,::-:,n). Let V-D=P+>Q; (by assumption on the embedding 
i=2 


QiAQ;AP). Let H be the hyperplane in the ambient space P* of U 
derived from the set (ws); UM H consists of all divisors in @ which contain 
at least one point of VM L. Denote by &p, resp. eo, (t= -,h), the 
linear subsystems of ¥ consisting of all divisors through P, resp. Qi. &p is 
a proper subsystem of & (since P is not a base point) and also the ¥o, are 
proper; in fact, XY ¢ ¥g, since Q; is generic on V over & and X is rational 
over k. Moreover, for iAj we have o,A~ Lo, since Q; and Q; are inde- 
pendent generic points of V over k& (since V is adaptable embedded). Let 
Up, Ui, (i=2,- be the Chow varieties of Zp and respectively. 
Then the above stated properties can be translated as follows 


UNH=Upvu U,U---UUy, Ch(X)¢€U, (t=2,-- -,h) 

and U; AU; foriA~Aj. By Lemma 2 deg U by Lemma 1 deg Up= h"™ 
and degU;=h"* Therefore we most have 

h 

U-H =1:Up+ 21: Ui. 

4=2 
By the criterion of multiplicity 1 [8; VI, Th. 6] the proposition is proved 
if dim U = 1, and if dim U > 1, it suffices to prove (since Ch(X’) ¢ U;) that 
Ch(X) is simple on Up. Therefore proceeding by induction on dim U the 
proof is complete. 

Proposition 2.5 Let V" be a complete, non-singular projective variety. 

Let U be the Chow variety of a maximal, total, regular family U of positive 
divisors on V. Let XEU; then the Chow variety of the complete linear 
system £(X), determined by X, is a simple subvariety of U. 


Proof. let Ch(X,) be a simple point of U and let k be a common field 
of definition for V, U and the Pic.(V) over which X and YX, are rational. 
Consider the rational mapping h: U—Pic.(V) defined by h(Ch(X*) ) 
= Class (1* — X,), where A* is a generic element of U over k; h is defined 
over k. Let T, be the graph of h on U X Pic.(V). Let Ch(X*) be a generic 
point of U over k, put »* =Cl.(X*—X,) and »—ClL(X—X,). If we 
denote by (X*), resp. (1), also the Chow varieties of the complete linear 


systems determined by X*, resp. X, then we have by [6, I, Prop. 10 and Cor. | 


* The writer owes this proposition to T. Matsusaka; it is of special interest for it 
follows from this proposition that the Picard variety can be constructed in precisely the 
same way as the Jacobian variety is constructed by Chow in [3]. (See the remark 


following this proposition. ) 


he 
Ns 
ic 
of 
a- 
n 
of 
uC 
ul 
d 
) 
t 
(Pt X& = £(X*) X and TEN Xn) = L(X)Xy. Since U is a 


108 J. P. MURRE. 


regular family, all the linear systems have the same dimension, and then it 
follows by Lemma 2 (and footnote 2) that the corresponding Chow varieties 
have the same degree. Therefore we must have T,:(P* KX 7) =¥(X)X7 
(and not a multiple of that cycle since this contradicts the fact that 
is the specialization of X7*) over the specialization 
n* —> x with respect to k). Hence by the criterion of multiplicity 1 [8, VI, 
Th. 6] we see that £(X) X 7 is a simple subvariety of T,. It suffices therefore 
to show that the mapping h is regular (in the sense of [8]) at the subvariety 
£(X) of U. 

Let C be a generic 1-section of V over k, J the Jacobian of C and ¢: C3 J 
the canonical mapping. Let B be the abelian subvariety of J generated by 
the points S¢((X*—X,)-C). Then by [4]* B is a model for the Pic.(V) 
and the mapping h’: U->B defined by h’(Ch(X*)) =S¢((X*—X,) -C) 
is the canonical mapping; hence we can take Pic.(V) —B and h=h’. If 
K > k is such that C, J, ¢ are defined over K, then we must show that h’ is 
regular at Ch(X’), where Ch(X’) is a generic point of (X) over K. In 
view of its application in the next theorem we state the next lemma. 


Lemna 8. If C ts a generic 1-section of V over k(Ch(X’)) and if J, ¢, 
B and h’ are introduced as above, then h’ is regular at Ch(X’). 


Proof. Instead of considering h’ we can consider f: U->J defined by 
f(Ch(X*) ) = S¢(X*-C) since h’ and f differ only by a constant on J. Let 
deg(X*-C) be d. Then we have the following commutative diagram: 


U—— J 


4 Since [4] is still unpublished, we indicate how the proof here can also be obtained 
from Chow’s results. It is irrelevant for our considerations that Pie.(V) is embedded 
in one Jacobian, an embedding into a product of Jacobians is sufficient. By the so-called 
regularity theorem [Lang, Abelian Varieties, VIII, Th. 9] there is such an embedding. 
However, we must also have a connection between the natural mapping X > Cl.(X — X,) 
and the intersection of XY with the generic l-sections. In fact, we must have commu- 
tativity in the following diagram (where we can restrict to one Jacobian), 


where h is the natural mapping X > Cl.(X — X,), A, is the canonical mapping of the 
k(u)/k-trace Pic.(V) to J, and y(Ch(X)) =S8¢((X—X,):C,). This commutativity 
follows essentially from Th. 12 and Th. 4 in Lang’s book, Chap. VIII. 


f 
y 
U ———_> J, 
Pic.(V) 


FAMILIES OF POSITIVE DIVISORS 109 


where C is the Chow variety of positive divisors of degree d on C, g is the 
d 

mapping g(Ch(X*)) =Ch(X*-C) and y is the mapping ¥(>P;) = S¢(P;), 
j=l 


where P; are points on C. Since C@ is non-singular,’ it suffices to prove that 
g is regular at Ch(X’). Let the Chow form of X’ be S,&M)(U) where the 
M,(U) are monomials in the letters Ui; (7 Let 


N 
C be the intersection of V with a linear space LZ defined by >} vijZ;—0 
j=0 


*+,n—1) with vj; independent transcendentals over k(Ch(X’)). 
6 
Putting L-X’ =>) Pa with Po = (pao: * Pan) we have by the definitions 
a=1 
of the Chow forms of X’ and X’-Z the relation 


where p(v) is a rational function in the v’s and },,NV,(U) is the Chow form 
of X’-L (in the letters U,;). Therefore the y, are polynomials in the & (with 
coefficients in k(v)) and since, of course, not all these polynomials are zero, 
the mapping g is regular at Ch(X’). 


Remark. It follows from Proposition 2 that the variety W of [6, II, 
page 59] is itself a model for the Pic.(V). It follows from the properties 
stated there that is suffices, in order to show this, that W is non-singular. 
W is the Chow variety of the family of linear systems €(X) on U. Since U 
is regular, every linear system has the same dimension and therefore by 
Lemma 2 the same degree. Therefore we have an involutional system in the 
sense of [2]. By a theorem of Chow [2, p. 258] it suffices to show that each 
¢(X) is a simple subvariety of U, but this is precisely the assertion of 
Proposition 2.° 


THEOREM. Let V be acomplete, non-singular variety, adaptable embedded 
in projective space. Then the Chow variety of a maximal, total, regular family 
of positive divisors on V is non-singular. 


Proof. Let U be the Chow variety under consideration ; let Ch(X) € U. 
Let Ch(X,) be a simple point on U. Consider as in the proof of Proposition 2 


° There is an oversight on page 472 of [3] in the proof of the non-singularity of 
C (in case the divisor has multiple components). However, by using the same argu- 
ments as in the proof of Proposition 1 this can be corrected; the essential point being 
that we know the degree of the Chow variety OC in terms of the degree of C. 

*The method of Matsusaka for constructing the Picard variety can also be used to 
construct the so-called “ generalized Picard varieties ” in the sense of Tate (see L. Lang, 
Abelian Varieties, page 176). We hope to return to this question on some future 
occasion, 


it 
leg 
\ 7 
lat 
on 
T, 
ty 
af 
f 
is 
n 
D, 
y 
t 
| 
| 


110 J. P. MURRE. 


the mapping h: U— Pic.(V) defined by h(Ch(X*) ) = Cl.(X* — Xo), where 
Ch(X*) is a generic point of U (over a field & of definition for V and U over 
which X and X, are rational). Let 7—Cl.(X —X>_) ; then, if I, is the graph 
of h, we have seen that T,: (Pt X ») = £(X) X  (P* is, as usual, the ambient 
space of U, ¥(X) is the Chow variety of the linear system determined by 1). 
First, we want to show that Ch(X)X 7 is simple on Ty. By [8, VI, Th. 6] 
it suffices to show that Ch(X) X » is simple on (X)Xy. This follows from 
the fact that @(X) is non-singular by Proposition 1. Therefore it suffices 
to show that h is regular at Ch(X). Introducing a generic 1-section C of V 
over k and J and B and h’ as above we can take by [4] Pic. V = B and h=l’. 
Then h’ is regular at Ch(X) by Lemma 8. 


NORTHWESTERN UNIVERSITY AND 
STATE UNIVERSITY, LEIDEN (NETHERLANDS). 


REFERENCES. 


[1] W. L. Chow and B. L. van der Waerden, “ Zur algebraischen Geometrie IX,” Mathe- 
matische Annalen, vol. 113 (1937), pp. 692-704. 
[2] W. L. Chow, “Algebraic systems of positive cycles in an algebraic variety,” 
American Journal of Mathematics, vol. 72 (1950), pp. 247-283. 
, “The Jacobian variety of an algebraic curve,” ibid., vol. 76 (1954), pp. 
453-476. 
[4] W. L. Hoyt, Unpublished (to appear soon). 


[5] S. Lang, Introduction to Algebraic Geometry, Interscience, New York, 1958. 


[3] 


[6] T. Matsusaka, “On the algebraic construction of the Picard variety I and II,” 
Japanese Journal of Mathematics, vol. XXI (1951), and vol. XXII (1952), 
pp. 217-235 resp. pp. 51-62. 

[7] 


, “On algebraic families of positive divisors and their associated varieties,” 

Journal of the Mathematical Society of Japan, vol. 5 (1953), pp. 113-136. 

[8] A. Weil, Foundations of Algebraic Geometry, American Mathematical Society Col- 
loquium Publications, vol. 24, New York, 1946. 

[9] O. Zariski, Introduction to the problem of minimal models in the theory of alge- 

braic surfaces, Publications of the Mathematical Society of Japan, vol. 4 

(1958). 


ON THE ALGEBRA OF REPRESENTATIVE FUNCTIONS 
OF AN ANALYTIC GROUP.* 


By G. and G. D. Mostow. 


1. Introduction. Let G be a real or complex analytic group. If p is 
an analytic finite dimensional representation of G, and if ¢ is a linear func- 
tional on the algebra of all linear endomorphisms of the representation space 
of p, then the composite ¢ © p is called a representative function on G associated 
with p. Throughout, R, or R(G@), if the group is to be exhibited, denotes the 
algebra of all complex valued representative functions on G. In the analysis 
of R, a special role is played by the group Hom(G,(C) of all analytic homo- 
morphisms of G into the additive group C of the complex numbers. By 
composition with the exponential map of C into the multiplicative group C* 
of the non-zero complex numbers, we obtain the subgroup Q = exp(Hom(G, C)) 
of the group Hom(G, C*) of all analytic homomorphisms of G into C*. It is 
a fundamental feature of the generalized Tannaka Theorem (see [2], [3], [4]) 
that the departure of the representation theory of G from that of an algebraic 
linear group depends entirely on the non-triviality of Q. It is for this reason 
that Q is singled out in a natural way in the structure of R. 

A subalgebra B of R will be called a basic subalgebra if it satisfies the 
following conditions: (1) B contains the constants, (2) B[Q] —R, (3) the 
elements of Q are free over B, (4) in the real case, B is stable under the 
complex conjugation of R. It is known from [3] and [4] that there always 
exists a finitely generated basic subalgebra. However, this result is inadequate 
in as much as it ignores the G-module structure of R. Actually, we shall 
show in Section 3 that there always exists a basic subalgebra that is stable 
under the left G-translations and, in fact, has the additional stability property 
that its ‘semisimple part’ is stable under both the left and the right G-trans- 
lations. Although it is easy to see (Section 3) that all basic subalgebras 
are isomorphic as algebras, they may differ radically in their behaviour under 
the action of G. On the other hand, we shall show in Section 4 that the 
normal basic subalgebras, i.e., the basic algebras that are left stable and whose 
semisimple part is two-sidedly stable, can be classified into orbits under the 
right G-translations, which correspond in a natural 1-1 fashion to the con- 


* Received August 10, 1960. 


111 


112 G. HOCHSCHILD AND G. D. MOSTOW. 


jugacy classes of the decompositions of G into semidirect products of the type 
described in Section 2. 

In Section 6, we analyze the group of the proper automorphisms of RP 
(i.e., the automorphisms leaving the constants fixed and commuting with the 
right translations) by means of a stable basic subalgebra. In particular, we 
show that this group is a semidirect product of a subgroup naturally isomorphic 
with Hom(Q,C*) by the normal subgroup of the left translations (com- 
plexified, in the real case). Section 7 is a direct application of the existence 
of stable basic subalgebras and concerns the representations of G@ as closed 
subgroups of full linear groups. 

In Section 8, we show that the two-sidedly stable basic subalgebras (which 
do not always exist) correspond in a 1-1 fashion to the rational equivalence 
classes of faithful representations of the complex analytic group G@ as an 
algebraic linear group. In particular, it follows from this and the correspon- 
dence between normal basic subalgebras and decompositions of G@ that any 
two such ‘algebraic structures’ of G are conjugate by an analytic auto- 
morphism of G, whence we obtain a description of the set of all algebraic 
structures on G which exhibits this set as an affine space in a natural way. 
On the level of the decomposition theory of G, the conjugacy result is due 
to B. Kostant (Theorem 8.4). 

The statements of the results obtained here and the main features of 
their proofs are intelligible without reference to our previous papers ([2], 
[3], [4]) on this topic. Nevertheless, we lean heavily on the notions and 
techniques of these papers, and we have not covered all the details of the 
proofs by explicit references. 


2. Nuclei and decompositions. For later use, we review some known 
results concerning decompositions of analytic groups into semidirect products. 

We shall say that a (real or complex) Lie group G@ is reductive if G@ has 
a faithful finite dimensional analytic representation and if every finite dimen- 
sional analytic representation of G is semisimple. By a nucleus of a Lie 
group G we shall mean a closed, normal, solvable and simply connected 
analytic subgroup K of G such that G/K is reductive. Let N denote the 
radical of the commutator subgroup G’ of G. Then every finite dimensional 
analytic representation of G is unipotent on NV. Now suppose that G has a 
nucleus K. Then G has a finite dimensional semisimple analytic represen- 
tation whose kernel is exactly K. Since the restriction of a semisimple 
representation to a normal subgroup is still semisimple, we conclude that 
NCK. Thus N is contained in every nucleus of G. 


ANALYTIC GROUP. 113 


If G is an analytic group that has a faithful finite dimensional analytic 
representation then G has a nucleus. Moreover, if K is any nucleus of G 
then G is a semidirect product H-K, where H is a closed analytic subgroup 
of G and, of course, is reductive. 


In the complex case, the existence of a nucleus K for an analytic group 
@ implies the existence of a semidirect product decomposition G=—H-K, 
even when it is not assumed that G is faithfully representable, and the existence 
of a faithful representation is then a consequence of the existence of a nucleus 
[4, Th. 3.6]. In the real case, the proof of the second assertion above is 
contained in the proof of [2, Th. 9.1]; one merely has to observe that any 
given nucleus of G may take the place of the group K used in that proof. 
The first of our two assertions above, in the real case, is part of [2, Th. 9.1]; 
in the complex case, the existence of a nucleus is part of [4, Th. 4.2]. 

In our later applications of nuclei, we shall use the fact that the Lie 
algebra of K can be written as a sum of the Lie algebra of N and another 
nilpotent Lie algebra that lies in the centralizer of the Lie algebra of H. 
What we shall need is contained in the following lemma. 


Lemma 2.1. Let & be a Lie algebra that is a semidirect sum §+ 8, 
where R is a solvable ideal and § is a complementary subalgebra that 1s 
reductive in G. Let N—[G,RK]. Then there is a nilpotent subalgebra $B 
of R such that [H, PB] = (0) and R=Y+MN (not necessarily semidirect). 


Proof. Let 2 denote the centralizer of § in R. Since R is semisimple 
as an §-module (under the adjoint representation), it is clear that R = 2+ MN. 
For x€ 2, denote by 2” the subspace of all elements of & that are annihilated 
by some power of the inner derivation effected by z. If we choose 2 so that 
¢ is of the smallest possible dimension then 2 (is a Cartan subalgebra of 
£ and, in particular,) is a nilpotent subalgebra 8 of &. By Fitting’s Lemma, 
we have £— 8+ 6, where © is a subspace such that [7,6]—G. Hence 
SCM, and we conclude that R= MN, completing the proof. 


3. Basic subalgebras. We begin with two elementary facts concerning 
basic subalgebras that are important for our purpose. 


PRoposiTion 3.1. Let U and V be any two basic subalgebras of R. 
Then there exists a unitary C-algebra isomorphism of U onto V. 


Proof. For every f € R, write f = dX oeq’a(f)q with vg(f) € V. Similarly, 
define the maps u,: R->U, for each qg€Q. Now define the map ¢: U-V 
by o(f) => aeqva(f). Then ¢ is evidently a unitary C-algebra homomorphism 


8 


e 
( 


114 G. HOCHSCHILD AND G. D. MOSTOW. 


of U into V. Interchanging the roles of U and V, we obtain a unitary C- 
algebra homomorphism y: VU. Let f€ U. Then we have 


= Xe tal O(f)) = Law Ua (Ve (f)) 
= Daa (Ve (Ff) 9’) 
= Da Ua (Xe Ve (fF) 9’) 
= Da Ua(f) =f. 
Thus yo¢ is the identity map on U, and similarly $°y is the identity map 
on V. Hence ¢ is a C-algebra isomorphism of U onto V. 


Proposition 3.2. Let U be a basic subalgebra of R that 1s stable under 
the left (or right) translations with the elements of G. Then Hom(G,(C) C U. 


Proof. Let h€ Hom(G,C) and write h = Zeeqtalh)a with ug(h) € U. 
Translating on the left with an element 7€ G, we obtain from this 


h(t) +h— Ug(h)) g(a) 
Comparing coefficients, we get 
Ug(h) = (x-ug(h))q(x), for every gA1. 
Now evaluate at the identity element 1 of G. This yields 


Ug(h) (1) = ug(h) (2) q(z). 


Thus, for every g=41, u,(h)q is a constant. We conclude that u,(h) =0, 
for every g1, whence h =u, (h) € U, q.e.d. 

In studying the algebra R = R(G) of the complex valued representative 
functions on the (real or complex) analytic group G, we may assume (in 
virtue of [2, Th. 7.1] and [4, pp. 89-90]) without loss of generality that G 
has a faithful finite dimensional analytic representation. This assumption 
will be in force from now on. Let G=H-K be a semidirect decomposition 
as discussed in Section 2. Let &, §, & be the Lie algebras of G, H, K, 
respectively. Let N be the radical of the commutator subgroup @’ of G. 
Then N CK, and the Lie algebra 9 of N coincides with [G,R]. Let 
*,%m be a basis for Jt. Let +, be elements of the nilpotent 
algebra $$ of Lemma 2.1 such that 2,,- - -,a2, is a basis for R. Now every 
element of G can be written uniquely in the form 


i=1 


where h€ H and the c; are real or complex numbers (note that WN is simply 


ANALYTIC GROUP. 115 


connected and nilpotent). We define functions w,,- - -,u, on G such that, 
for each j, the value of u; at the element of G written above is c; Let S be 
the algebra of functions that are generated by the constants and 1,° °°, Un. 
Let RX denote the subalgebra of R that consists of the elements left fixed by 
the translations with the elements of K. The analysis of R made in [3, 
Section 4], which would be the same in the complex case as it was in the real 
case, has shown that RX is a basic subalgebra of R (the notation of [3] is 
such that our present RX is there denoted R4%(G); the notation we adopt 
here is based on the principle that if W is any left module for a group A then 
W4 stands for the submodule of the A-fixed elements of W). 

We claim that S is stable under the left translations with the elements 
of G. For 7>m, we have u;€ Hom(G,C), so that g-u;—uj(g) + uj€ 8, 
for every g€ G. Now suppose that jm. Let v¢€ H. Then v commutes 
with every exp(c,%;,) with k>m. Hence, if ¢€ 9 and v* denotes the auto- 
morphism of Yt that corresponds to vt under the adjoint representation, 


hexp(Cn@n)* exp (t) v 
== hvexp(Cnin)* XP (t) v 
= hvexp(Cntn)* exp (v*(t) ). 
We see at once from this that v-u; is a linear combination of w,° - -, Um. 


Now let s€ 9t, and let us consider the translate exp(s)-u,; The nilpotency 
of N implies that, if ¢€ N, we have 


exp(¢)exp(s) = exp(f(s,¢)), 


where f is a polynomial map of (J,9t) into NM. Hence it is clear that 
exp(s)-u; is a polynomial in -, Um. 

There remains to consider the translates exp(d,v,)-u;, where k >m 
and d; is an arbitrary real or complex number. Since §$ is nilpotent, we 
have 

EXP (Cnn) ) EXP 


=eXP(Cn@n)* -exp( (Cy + dy) 


where s is a linear combination of basis elements of [58,98] whose coefficients 


are polynomials in Cm, * *,¢, and dy. Writing ¢ for > cai, we have 
i=1 


P 
n 
n 
at 
it 
y 
y 


116 G. HOCHSCHILD AND G. D. MOSTOW. 


Since s€M, the product of the last two factors is the exponential of 


f(s, which is a linear combination of -,2%m whose 
coefficients are polynomials in ¢,,- - -,¢, (the dependence on d;, being ignored), 
Hence exp(d,a,) -u; is a polynomial in w;,° *, Un. 


Thus we have shown that S is stable under the left translations. Evi- 
dently, RX is stable under the left and the right translations. Moreover, RX 
is canonically isomorphic with the algebra of all representative functions on 
the reductive analytic group G/K. By [2, Th. 9.2] (for the real case) and 
[4, Th. 5.2] (for the complex case), R* is therefore finitely generated as a 
C-algebra. Hence R*§ is finitely generated as a C-algebra. Put B= R¥§, 

It is easy to see (cf. [3, Section 4]) that the canonical map RX @p 8S > RKS 
is an isomorphism. We recall that a representative function f on G is called 
semisimple if the representation of G by left translations on the space spanned 
by the translates of f is semisimple. The semisimple elements of B con- 
stitute a subalgebra B, of B that is stable under the left translations. We 
claim that B,—R*. Evidently, RX C B,. Conversely, let f€ B,. We can 


write f= > pisi, where pi,- -,p, are C-linearly independent elements of 
i=1 

and the s,;€ 8. Then we have, for every K, pi(x-s;). Since, for 


given fi," * *,pr, this representation of x-f is unique, it follows that, for 
each 1, the K-module spanned by the left K-translates of s; is a K-homomorphic 
image of the K-module spanned by the left K-translates of f. Since K 
is normal in G, the K-module generated by f is semisimple. Hence the 
K-module generated by s; is semisimple, for each 7 On the other hand, V 
is unipotent on every finite dimensional N-submodule of R. Hence s; must 
be left fixed by the left N-translations, and it follows that the restriction 
of s; to K may be regarded as a representative function on the vector 
group K/N and, as such, is a semisimple representative function on K/N. 
Hence the restriction of s; to K is a C-linear combination of elements of 
exp(Hom(K/N,(C)), i.e., it coincides with the restriction to K of a C-linear 
combination of elements of Q = exp(Hom(G,C)). However, it is clear from 
[3, Section 4] that the restriction homomorphism R(G@) — R(K) is a mono- 
morphism on S[Q]. Thus we conclude that s,€ C[Q]. Since the elements 
of @ are free over S, this implies that s; is a constant, whence f € R¥. 


It is convenient to introduce the following definition: a normal basic 
subalgebra of R is a basic subalgebra B such that B is stable under the left 
translations and B, is stable under both the left and the right translations. 


The algebra B constructed above has been shown to be a normal basic 


ANALYTIC GROUP. 117 


subalgebra. Since B is finitely generated as a C-algebra, it is clear from 
Propostition 3.1 that every basic subalgebra of R is finitely generated as a 
(-algebra. Next, we observe that the kernel of the representation of G by 
left translations on B, is the nucleus K of G. Indeed, since B, = R¥*, this 
kernel contains K. Since G=H-K, and since the representation of H by 
left translations on RX must be faithful (because H has a faithful finite dimen- 
sional representation and RX is canonically isomorphic with the algebra of all 
representative functions on H), it follows that the kernel must coincide 
with K. 

Now let D be any basic subalgebra of R that is stable under the left 
translations. We claim that R,—=D,[Q]. Evidently, D,[Q]C Rs. Con- 
versely, let f € R, and write f= Deqtalf)% with d,(f) € D. Then we have, 
for every G, = and D. Denote by 
(tf the space spanned by the left translates of f, etc. Then (G-d,(f))q is 
a @-homomorphic image of G-f, for each qg, and hence is semisimple. Lvi- 
dently, the G-submodules of (G-d,(f))q are the G-modules Vg, where V 
ranges over the G-submodules of G-d,(f). Hence we conclude that each d,(f) 
is semisimple, i.e., that dj(f) € Ds. Hence f€ D,[Q], and our claim is proved. 

Let ¢ denote the coefficient sum isomorphism of B onto D, as in Proposi- 
tion 3.1. It follows at once from what we have just seen that (Bs) C Ds. 
Similarly, ¢7?(D,) C Bs. Hence ¢ maps R* = B, isomorphically onto D,. 
For «€ G, define the map ¢,: RKC by ¢.(f) =¢(f)(x). Then ¢, is an 
algebra homomorphism leaving the constants fixed. Furthermore, in the real 


case, db, evidently commutes with the complex conjugation. By [2, Prop. 2.5], 
defines a unique proper automorphism of such that ¢(f) (1) = ¢2(f). 
In the real case, £ commutes with the complex conjugation. Moreover, it is 


clear that the map x—> € is continuous, so that, in the real case, £ belongs 
to the connected component of the identity in the group of the real proper 
automorphisms of RX. Now R* may be regarded as the algebra of all repre- 
sentative functions on the reductive analytic group H. Hence it follows 
from [2, Th. 1.1.1] (for the real case) and [4, Th. 5.2] (for the complex ca:c) 
that € is the left translation by an element 2,€ H. Thus we have ¢(f) (.r) 
= f(2,), for every f R*. 

Now let d€ D, and write with f,€ Then 
d(a,) = ) (v1), because q(z,) = 1, for each gq, since x, € Thus 
d(a,) = = (a), by the proof of Proposition 3.1. , 

Now suppose that D is a normal basic subalgebra. Then, for every d€ D, 
and every y€ G, d-y€ D,, and hence (d-y)(2,) = (d-y)(.r). Hence we 
have 2,-d=.x-d, for every d€ D,. Now let L denote the kernel of the 


of 
ose 
d). 
RK 
on 
ind 
Sa 
KS. 
Kg 
led 
1ed 
on- 
We 
an 
RK 
for 
for 
hic 

K 
the 

N 
ust 
ion 
tor 
IN. 

of 
ear 
om 
no- 
nts 
left 
ns. 
sic 


118 G. HOCHSCHILD AND G. D. MOSTOW. 


representation of G by left translations on D,. Then our last result shows 
that G=HL. Moreover, the isomorphism ¢: RX— D, is evidently an H- 
module isomorphism. Since the representation of H by left translations on 
RE is faithful, it follows that the representation of H by left translations on 
D, is also faithful. Hence HM L= (1), and G is the semidirect product 
H-L. Clearly, N CL, and L/N is isomorphic with G/(HN) and hence 
with K/N. Hence L is solvable. Moreover, since H:L—H-K, it follows 
that L is homeomorphic with K, and thus is simply connected. Hence L is a 
nucleus of G. Now construct a normal basic subalgebra EF of FR as above, 
but using the nucleus Z. Then 7,—R4, and D,[Q]—R,—F;,[Q]. Since 
D, C FE, and since the elements of @ are free over F;, this implies that 
D, = E, = R*. 


We may now summarize our results as follows. 


THEOREM 3.1. Let G@ be an analytic group having a faithful finite 
dimensional analytic representation. Let R be the algebra of the repre- 
sentative functions on G, and let K be a nucleus of G. Then there exists a 
normal basic subalgebra B of R such that B, = R¥ and K is the kernel of the 
representation of G by left translations on B,. If D is any normal basic 
subalgebra of R then the kernel of the representation of G by left translations 
on D, is a nucleus L of G, and D,= R*. 


We observe that if G is solvable then every basic subalgebra of R that 
is stable under the left translations 1s a normal basic subalgebra. Indeed, 
if G is solvable we have G’ = WN, so that G/N is abelian. Now if B is a left 
stable basic subalgebra of # then the elements of B, are left fixed by the left 
translations with the elements of V. It follows that, for every f € B, and every 
x€ G, f-x—2-f, which proves our assertion. 

In general, this last result does not hold, as is shown by the following 
example. Let H denote the group of all 2 by 2 complex matrices with deter- 
minant 1, and let G=H XC. Let a, B, y, & be the functions on @ that 
associate with each element of G@ the entries of the matrix component of that 
element, so that a3 — By =1. Let p be the projection of G onto C with kernel 
H, and put g=exp(p). Then it is easily seen that a left stable basic. sub- 
algebra of F is given by 


B=C[aq, Bq, p] 
We have 
B,=C[aq, Bq, 


and one verifies directly that B, is not stable under the right translations. 


ANALYTIC GROUP. 119 


Moreover, the kernel of the representation of G by left translations on B, is 
discrete, in this case, and thus is certainly not a nucleus of G. 


4, Relations among normal basic subalgebras. 


Lemma 4.1. Let B be any normal basic subalgebra of R, and let K 
be the kernel of the representation of G by left translations on Bs. Then 
BN = RX[Hom(G,C) ]. 


Proof. From a semidirect product decomposition G—=H-K, we see that 
G/N is the direct product of the reductive group (HN)/N by the vector 
group K/N. Hence R(G/N) may be identified with the tensor product of 
R(G/N)¥/N and R(G/N)@IN, Now R(G/N)@)/N is canonically iso- 
morphic with R(K/N) =C[Hom(K/N,C),exp(Hom(K/N,C)]. Hence we 
have 


R(G/N)@)/N — C[Hom(G/N, C), exp(Hom (G/N, C) ]. 
Now if we apply the canonical isomorphism of R(G/N) onto R(G)% we find 
RN = RK @, C[Hom(G, C), Q] = RX[Hom(G,C), Q]. 
Thus we have 
R*X[Hom(G,C)]C BY C R¥X[Hom(G,C)][Q]. 
Since the elements of Q are free over BY, this implies that 
BN = RX[Hom(G,C) ]. 


Now let B be the normal basic subalgebra of R constructed from the 
nucleus K as in Section 3. B= R¥[u,,- -,uUn], and Un) is a C- 
basis for Hom(G,C). Hence B= Let Z 
denote the center of NV. The elements -,Um were defined with reference 
to a basis 2,,° - *,%m of the Lie algebra of N. Choose this basis so that 
11,‘ * *,%p is a basis for the Lie algebra of Z. Then it is clear from the 
definition of the functions u; that u;€ B4, for every 1> p. Hence 


B == Up. 
If we consider the natural action of the Lie algebra of Z on B, we see imme- 
diately that, for i=1,---,p, 2; annihilates while 2;(u;) = 8. Hence 


we see from the familiar partial differentiation argument that the monomials 
in the functions w,,- - -,u, are free over B7. Moreover, if we examine the 


VS 
7. 
n 
on 
ce 
VS 
re, 
ce 
at 
te 
a 
lé 
ic 
Ls 
ut 
d, 
ft 
tt 
it 
t 
] 


120 G. HOCHSCHILD AND G. D. MOSTOW. 


argument of Section 3 with which we showed that S is stable under the left 
translations, we see that, actually, the free B7-module 


BZ 4 B2u, 


is stable under the left translations. 
Now we are in a position to prove the following theorem. 


THEOREM 4.1. Let G be an analytic group, R the algebra of the repre- 
sentative functions on G. Let A and B be two normal basic subalgebras of R 
such that A,—=B,. Then there is an element x€ N such that B-rx=—A. 


Proof. We make an induction on the dimension of NV. Let Z be the 
center of N. If Z—=WN we have A? = B%, by Lemma 4.1. If ZA~WN we 
consider G/Z, identifying R(G/Z) with R42. Then B4 and A? become iden- 
tified with normal basic subalgebras of R(G/Z), and (B7), = B, = A, = (A?),. 
The radical of (G/Z)’ is evidently N/Z. Hence, assuming that the theorem 
is proved in lower dimensions (to start the induction, note that if NV is trivial 
then A = B, by Lemma 4. 1), there is an element 2 € N such that (B?) - x2 = A, 
Replacing A by we may therefore assume that B42 —A%. Moreover, 
it evidently suffices to prove the theorem in the case where B is as described 
above, which we shall now assume. 

Every element f€ R may be written uniquely in the form 


f= 


with ag(f)€ A. Then a, is evidently a G-module homomorphism (but not 
necessarily an algebra homomorphism) of FR into A, and a, is the 
identity map on B#=—A7. Let a denote the restriction of a, to 
BZ 4+ BZu,+---+ BZu,. Since the monomials in - -,u, are free over 
BZ, we can extend a to an algebra homomorphism 8: B—A which, like «, 
commutes with the left translations. We extend 8 to an algebra endomor- 
phism y of R such that y(q) —q, for every g€ Q. Clearly, y still commutes 
with the left translations. Furthermore, since Hom(G,C) C B74, y leaves the 
elements of Hom(G,C) fixed. Hence y(exp(f)) —exp(y(f)), for every 
f€ Hom(G,C). Finally, we note that, in the real case, a; commutes with the 
complex conjugation, whence also y commutes with the complex conjugation. 

In the complex case, it follows at once from [4, Th. 5.1] (with left and 
right translations interchanged) that y is the right translation by an element 
a«€ G. Since y leaves the elements of B? fixed, we have, in particular, f-2 =f, 
for every f€ RX[Hom(G,C)]—B%. From this we conclude first that r¢ K 
(because RX separates the elements of G/K) and then that x€ N (because 
Hom(G,C) separates the elements of K/N). 


ANALYTIC GROUP. 121 


In the real case, we appeal to [3, Th. 5.1] to conclude that there is an 
element z in the universal complexification G+ of G such that y(f) =f-2, 
for every f€ B. Now G* contains the universal complexification N* of N, 
and it follows as just above, from the fact that y leaves the elements of 
RK[Hom(G,C)] fixed, that x¢€ N*. Finally, since y commutes with the 
complex conjugation, it follows that r¢ N. 

Thus, in either case, there is an element x€ N such that B-x4 C A. Since 
both B-a and A are basic subalgebras of FR, this implies that B-2— A, so 
that our theorem is proved. 

Theorems 3.1 and 4.1 give a one to one correspondence between the set 
of nuclei of G and the set of right G-orbits of normal basic subalgebras of 
R. In particular, the two-sidedly G-stable subalgebras of & that are generated 
by the normal basic subalgebras associated with a given nucleus K of G all 
coincide with one and the same two-sidedly G-stable finitely generated sub- 
algebra of R, which is thus invariantly associated with the nucleus K. In 
the general case, the representation-theoretical significance of this is not 
clarified. Moreover, this correspondence is not reversible ; the same two-sidedly 
stable subalgebra may be associated with several, even non-isomorphic, nuclei. 
This is shown by the following example. 

Let G be the group of 7-tuples (a, b,c, z, r, s,t), where z, r,s, ¢ are arbitrary 


complex numbers, a,b,c are non-zero complex numbers, and the multiplica- 


tion is given by 
= (aa’, 


For every integer n, we define a nucleus K, of G; K, consists of the elements 
(exp(z),exp(—z),exp(nz),2,7,8,¢), where z,r,s,¢ range over all complex 
numbers. We define the functions a, B,y,f,p,0,7 on G@ by a(a, b,c, 8, t) 
=a, etc. Let g—exp(¢). It is easily seen that RX» is generated by the 
functions aq-*, Bq, yq~" and their reciprocals, and that RX[£, p,o,7] is a normal 
basic subalgebra of #; in fact, it results from the construction of Section 3. 
Now one verifies easily that the two-sidedly stable subalgebra generated by this 
normal basic subalgebra is C[, B, B1,y7,¢,p,0,7]. This is the 
same for all n. On the other hand, if »0 then K, is not isomorphic with 
K,, because (K,)’ is of dimension 3 while (K,)’ is of dimension 2. 


5. The unipotent hull. For our present purpose, it will be convenient 
to assume, to begin with, that G is a complex analytic group. Let A be the 
group of all proper automorphisms of R, and let U denote the kernel of the 


ft 
R 
1e 
ve 
n- 
m 
al 
d 
t 
le 
0 
e 
y 
e 
d 
t 


122 G. HOCHSCHILD AND G. D. MOSTOW. 


natural representation of A on R,. We shall call U the unipotent hull of G. 

We claim that, for every finite dimensional left G-stable (and hence 
also A-stable) subspace S of R, the natural representation of U on S is 
unipotent. In order to see this, let us consider a composition series 
(0) =8,C8,C---CS,=—S8 for S as a G-module. Let ¢ be a linear 
function on 8 that vanishes on Sj;_,, and let s,;€ 8; Then the function ¢;/s, 
where (t;/s;) (x) =¢;(z-s;), for every x € G, is a representative function asso- 
ciated with the representation of G on S;/S;_,, and hence belongs to Rs. Hence 
u(ti/si;) =t,/si, for every U, whence ¢,(u(s;)) =t(s;), for every we U. 
Since this holds for all ¢; that vanish on S;, and since S; is U-stable, this 
implies that u(s;) —s,€ Si., for every we U. Hence S is unipotent as a 
U-module. 

We may choose a finite dimensional two-sidedly stable subspace S of R 
such that S and the elements of Q generate RF and the representation of the 
subgroup GU of A on S is faithful. This is done as follows: let f,,- - -,f, 
be a maximal set of linearly indpendent elements of Hom(G,C), and put 
fo=Cifi t+: ++ Cnfn, where - are rationally independent complex 
numbers. Put g;—exp(fi). Then, if K is any nucleus of G, qo,- qn 
separate the elements of K/N. There is a finite subset 7 of R containing a 
set of generators of RX and such that TU @Q generates R. We let S be the 
smallest two-sidedly stable subspace of R that contains Hom(G,C), 7, and 
gos’ * *;Qn- We claim that S satisfies our requirements. 

There remains only to show that the representation of GU on SQ is faithful. 
Suppose that 7€ G, we U, and zu leaves the elements of S§ fixed. Write 
G—=H-K, with H reductive, and c—hk, with h€ H and ke K. We have 
ru(gi) Since S, we have zu(q;) = q;. Hence 
we conclude that g;(k) =1, for i—0,1,---,n. By the choice of the qi, this 
implies that ke N. Since NV C U, this means that we may now assume that 
ce H. If fe RX then fe Rs, so that cu(f)—2-f. On the other hand, 
f belongs to the algebra generated by S, whence zu(f) =f. Thus 2-f =f, 
for every f€ RX. But this implies, since H, that Now Hom(G,(C) 
C 8S, so that u(f) =f, for every f€ Hom(G,C). Since exp(f) € Rs, we have 
u(exp(f)) =exp(f). By [4, Th. 5.1], it follows that w is the left translation 
by an element of G. Since S and the elements of Q generate RP, this implies 
that u=1, q.e.d. 

Now let Ag and Ug denote the restrictions to S of A and U. By [2, Props. 
2.6 and 2.9], Ag is the algebraic group hull of Gs. We claim that Ug coin- 
cides with the kernel, Vs say, of the semisimple representation associated with 


the representation of Ag on S. 


ANALYTIC GROUP. 123 


In order to see this, we consider the family of all finite dimensional two- 
sidedly stable subspaces JT of R. By the standard decomposition theorem for 
algebraic linear groups [5, Th. 6.1], the algebraic group A,r is a semidirect 
product Myr: Vr, where My is fully reducible. If 7, C T.2, the restriction 
from T, to 7, is a rational group epimorphism pr,,7, of Ar, onto Ar, By 
[5, Prop. 3.2], pr,r,(Vz,) is unipotent and pr,,7,(Mz,) is fully reducible. 
Hence A7z, is the semidirect product of these two groups, whence it is clear 
that pr,7,(Vr,) = Vr, Now let o7,7, be the restriction to Vr, of pr,,r,, and 
consider the inverse system of the rational group epimorphisms o7,,7,._ Evi- 
dently, the inverse limit of this system is precisely U. Hence we conclude 
from [2, Prop. 2.8] that Uy = V7, for every 7’, which proves our above claim. 

In particular, we have a semidirect decomposition Ay = Mg: Us, where 
Ms is a fully reducible group of automorphisms of S. Now we may identify 
GU with its image in Ag, and then we have GU = M-U (semidirect), where 
M=MgN(GU). Evidently, GU is a normal subgroup of Ag, whence M is 
a normal subgroup of My. Hence M is a fully reducible group of auto- 
morphisms of S. It follows that the action of M on the algebra generated 
by S is semisimple. Since S and the elements of Q generate FR, it follows 
that the action of M on FR is semisimple. Thus M is R-reductive, in the sense 
that R is semisimple as an M-module. 

Now suppose that G is a real analytic group. In this case, we shall define 
the unipotent hull U of G to be the kernel of the representation on R, of the 
group A, of the real proper automorphisms of Rf, i.e., the proper auto- 
morphisms that commute with the complex conjugation. Now we choose the 
space S used above so as to be stable under the complex conjugation, and 
we consider the inverse system of the o7,,7, obtained by admitting only those 
subspaces 7’ that are stable under the complex conjugation. Let (Ar), be 
the subgroup of A,r consisting of the elements of Ay that commute with the 
complex conjugation of 7. Then (Ar), is a real algebraic subgroup of the 
group of all real linear automorphisms of 7’. Since A? is the algebraic hull 
of Gp C (Ar),, it follows that the Lie algebra of Ar is the tensor product 
extension over C of the Lie algebra of (Ar), Now Vv, is the analytic sub- 
group of Ay whose Lie algebra is the set of all nilpotent elements of the radical 
of the Lie algebra of Av. Hence the Lie algebra of Vp is spanned over C 
by the set of all nilpotent elements of the radical of the Lie algebra of (A7),. 
Hence (V7), is the kernel of the semisimple representation associated with 


the representation of (Ar), on T. By considering the corresponding Lie 


algebra map, we see that o7,,r, maps (Vz,), onto all of (V7,),. Now it follows 
from the same inverse limit argument we used above (replacing [2, Prop. 2.8] 
with [2, Prop. 2.11]) that (Vr),-—Urz. 


8 
iy 
n 
n 
> 


G. HOCHSCHILD AND G. D. MOSTOW. 


We may now continue exactly as in the complex case to conclude that 
GU is a semidirect product M-U, where M is an R-reductive subgroup of 4A,. 

From now on, G may again be either complex or real. Since the analytic 
group M has a faithful finite dimensional semisimple analytic representation, 
M is a direct product P X V, where V is a vector group and P is a reductive 
analytic group ([2, Th. 7.2] and [4, Th. 4.1]). Now V-U is a simply 
connected, solvable, normal, closed analytic subgroup of GU, and GU is the 
semidirect product P-(V-U). Hence P is a maximal reductive analytic 
subgroup of GU. 

Now we recall that if Z is a linear analytic group, M a maximal fully 
reducible analytic subgroup of L, and T any fully reducible analytic subgroup 
of ZL, then there is an element ¢ of the radical of L’ such that ¢7t* CV 
(see [5, Th. 4.1]). 

Let K be a nucleus of G, and write G—=H-K, with H reductive. Since 
(GU )g lies in the algebraic hull of Gs, we have (GU)’ —G’. Hence we may 
conclude from the general theorem just quoted that there is an element ¢¢ V 
such that tH¢1C P. Since G=tHt"-K, it follows that GC MK. 

Let M and ye K. Then (GU)’=G’. On the other hand, 
xzyxty* is contained in the radical of: G. Since there is a continuous arc of 
such commutators joining zyz'y"* to 1, we conclude that ryx-y~* lies in the 
connected component of the identity of the intersection of G’ with the radical 
of G, and thus lies in the radical N of G’. Since N C K, we have therefore 
zyx *€ K. In particular, we conclude that 1M K is a normal subgroup of JI. 
Hence MOK is f?-reductive. Since Hom(G,C) separates the elements of 
K/N, while the representation of G on C+ Hom(G,C) is unipotent, this 
implies that MN KCN. Thus MNKCMNU, so that MN K = (1). 
Hence MK is the semidirect product M-K. 

We have obtained the following result. 


THEOREM 5.1. Let G be a real or complex analytic group, and let U be 
the unipotent hull of G. Then GU is an analytic group having a faithful 
finite dimensional analytic representation, and U is a nilpotent, simply con- 
nected, normal, closed analytic subgroup of GU. There is an R-reductive 
closed analytic subgroup M of GU such that GU is the semidirect product 
M-U. If K is any nucleus of G then G is contained in the semidirect product 
M-K. 


CoroLuary 5.1. The dimension of the unipotent hull U is equal to the 
dimension of any nucleus K. 


Proof. Since U’ C@OU=N, it is clear that U/N is a vector group. 


124 


ANALYTIC GROUP. 125 


We show first that Hom(U/N,C) is isomorphic with Hom(G,C). In doing 
this, we shall identify Hom(U/N,C) with the subgroup Hom(U,C)% of 
Hom(U,C). Let h€ Hom(U,C)%. Since [M,U]CN, we can extend h 
uniquely to a homomorphism of M-U into C that is trivial on M. The 
restriction to G of this homomorphism is an element h’€ Hom(G,C). Clearly, 
the map h—>h’ is a linear homomorphism of Hom(U,C)% into Hom(G,(C). 
Conversely, given f € Hom(G, C), define the map f*: UC by f*(u) = u(f)(1), 
for every u€ U. Since the representation of G on (C+ Hom(G,C))/C 
is trivial, the same is true for the representation of U, because the image of 
U lies in the algebraic group hull of the image of G. From this, it is easily 
seen that f* is a homomorphism, and hence that f*€ Hom(U,C)%. Moreover, 
it is verified directly that (f*)’ =f and (h’)*—h. Hence Hom(U/N,C) 
is isomorphic with Hom (G,C). Hence we have 


dim(U/N) = dim(Hom(U/N, C)) 
= dim(Hom(G, C)) = dim(Hom(K/N,C)) =dim(K/N). 


Hence dim(U) =dim(K), q.e. d. 

Put H=MnG. Then, since GC M-K, we have G=H-K. Further- 
more, M’ C G, so that M’ C H, i.e., M/H is abelian. 

Now consider the natural representation of M/H on R¥. Since M is 
R-reductive, this is semisimple. It follows that R” is spanned by the 
elements f€ such that, for every M, x(f) =¢(2)f, with €C. 
Clearly, 6 € Hom(M,C*), where C* is the multiplicative group of the non- 
zero complex numbers. Now ¢ may be regarded as an element of Hom(GU, C*) 
that is trivial on HU. The restriction to G is trivial on HN and therefore 
isan element g€ Q. Obviously, ¢ coincides with g on GU, and so on M, i.e., 
=2(q) (1), for every x€ M. Hence € R™, and we have shown that 
2H C RM[Q). 

We denote by f—f’ the involution of RF defined by f’(x) —f(a*), for 
every c€ G. The last result is equivalent to (R#)’ C(R™)’[Q], and it is 
clear that (R”)’ is the subalgebra of R consisting of all elements that are 
fixed under the right translations with the elements of H. By [2, Prop. 2.4] 
(taking account of the change in notation), we have (R¥)’RX —R. Hence we 
conclude that RX(R™”)’[Q] = R. 

We claim that the elements of Q are free over (R™)’. Let f; be linearly 
independent elements of (R™)’, and suppose, contrary to our claim, that 


there are elements q:,-**,Qn in Q such that Sfiq,—0. Then we have 
ia 


Xfiqr? =0. Hence, for all we U and me M, 
1=1 


that 
lytic 
ion, 
tive 
iply 
the 
ytic & 
ully 
oup 
M 
nce 
nay 
nd, 
of 
the 
cal 
ore 
M. 
of 
his 
i}. 
be 
Ful 
n- 
ve 
ct 
ct 
he 


G. HOCHSCHILD AND G. D. MOSTOW. 


um(f/)um (qc) =0, ie, 
i=1 
= 0. 


Since the elements of U separate the f;’ (for G C UM, and the elements of 
M leave the f/’ fixed), we can form linear combinations of the above relations, 
with varying wu and fixed m, so as to get m(q;*) =0, for all me M, and 
each i. But this is impossible, because M separates the elements of Q. This 
proves our above claim. 

Evidently, (R¥)’ contains Q and (R™)’. Since (R¥)’ C(R”)’[Q], 
we have therefore (R#)’ = (R™)’[Q]. Now R is canonically isomorphic with 
the tensor product of RX and (R”)’. Hence we may now conclude that the 
elements of Q are free over RX(R™)’. Hence RX(R™)’ is a basic subalgebra 
of R. Evidently, RX and (R™)’ are stable under the left translations. 

We have seen in Section 3 that R, = D,[Q], where D is any left stable 
basic subalgebra of R. By Theorem 3.1, we may take D so that D,= R¥. 
Hence R,—=R*X[Q]. Hence (RX(R™)’), C RX[Q]. But this implies that 
(RX(R™)’),—= RX. Hence we have the following result. 


TuHEoREM 5.2. Let G, M, K be as in Theorem 5.1. Then R¥(R™)’ is 
a normal basic subalgebra of R, and K 1s the associated nucleus of G. 


It is clear from Theorem 4.1 that every normal basic subalgebra of R 
has the form of Theorem 5.2; right translation by z€ N changes M to x*Mz. 


6. The group of the proper automorphisms. Let A denote the group 
of all proper automorphisms of R, let K be a nucleus of G, and let W denote 
the subgroup of A consisting of all elements of A that leave the elements of 
RXR” fixed. In the complex case, let P stand for the natural image of ¢ 
in A. In the real case, let P stand for the natural image of G* in A. Then 
P is a closed normal subgroup of A. In fact, by [8, Th. 5.1], P is the 
group of all perfect automorphisms of R, i.e., the proper automorphisms « 
such that exp(a(f)) = a(exp(f)), for every f€ Hom(G,C). We claim that 
A is the semidirect product W-P. Since RXR“[Q]—R and Hom(G,(C) 
C RER™, it is clear that PX W—(1). Hence it suffices to show that 
A=WP. Let ¢€ Hom(Q,C*). Since the elements of Q are free over 
RXR” and since RXR™ is stable under the right translations, there is one 
and only one element ag€ W such that ag(q) —¢(q)q, for every g€ Q. 
Clearly, the map ¢— a ¢ is an isomorphism of Hom(Q,C*) onto W. 

Now let be an arbitrary element of A. Let @’ denote the homomorphism 


ANALYTIC GROUP. 127 


of R into C that is given by B’(r) =B(r) (1), for every r€ R. For every g€ Q, 
there is one and only one f€ Hom(G,C) such that g=exp(f). Hence there 
is an element ¢€ Hom(Q,C*) such that 


$(exp(f)) =exp(6’(f)) LB’ (exp(f)) 1°: 


for every f€ Hom(G,C). One verifies directly that the automorphism a¢ 0 8 
of R is a perfect automorphism, so that ao@¢€ P. Hence BE WP, so that 
A=WP. 

We observe also that the elements of W commute with the elements of M, 
and hence with the elements of the maximal reductive analytic subgroup 
H=MNG of G. In the real case, we note that Ht is a maximal reductive 
analytic subgroup of G*; G+ == H*- K*, and the elements of W still commute 
with the elements of H+. Hence we may state our result as follows 


THEOREM 6.1. The group A of the proper automorphisms of R 1s a 
semidirect product W-P, where P is the natural image of G (in the complex 
case) or of G* (in the real case) in A, and W is isomorphic, via restriction 
to Q, with Hom(Q,C*). Moreover, the elements of W commute with the 
elements of some maximal reductive analytic subgroup of P. 


Let S be a finite dimensional two-sidedly stable subspace of R such that 
the natural representation o of P on 8 is faithful. Let H(S) denote the 
algebra of all linear endomorphisms of S. We know that the natural image 
As of A in E(S) is the algebraic group hull of Ps o(P). Let 8 denote 
the Lie algebra of P. We may identify % with its image o(%) in E(S), 
where o° is the differential of o. Now let « denote the adjoint representation 
of P in If we identify with then, for every pe P, a(p) 
becomes identified with the conjugation £—-0(p)fc(p) in the algebra 
E(o'(§%)) of all linear endomorphisms of o(%). Since Ag is the algebraic 
group hull of o(P), o ($8) is stable also under the conjugations with the 
elements of Ag, and the corresponding image of Ag in E(o'(§)) is the 
algebraic group hull of a(P). Transferred back to H(%), this means the 
following: the conjugation of P effected by an element of A is an analytic 
automorphism of P and determines a Lie algebra automorphism of %. We 
shall call the resulting representation of A in EH (§8) the adjoint representation 
of A on %. Our result is that this sends A onto the algebraic group hull of 
the adjoint group of P, i.e., the adjoint representation of A on the Lie 
algebra of P sends A onto the algebraic group hull of the adjoint group of P. 

It is an immediate corollary that if Z(P) is the centralizer of P in A 
then A = Z(P)P if and only if the adjoint group of P is algebraic. 


ns, 

nd 
his 

th 
he 

ra 

at 

1s 
R 

e 
yf 
n 

e 

t 

) 

t 

“4 


128 G@. HOCHSCHILD AND G. D. MOSTOW. 


However, the following example shows that, even when A—Z(P)P, 
P need not be a direct factor in A. Let G be the group of all pairs (a,b) 
of complex numbers, with the multiplication 


(a,b) (a’, = (a+a’,b + exp(a)b’). 


A right stable basic subalgebra B of R is generated by the constants and 
the two functions u,, w2, where 


u,(a,b) =b, and uz(a,b) =a. 
The translates of these functions are as follows: 


(a,b) =b+ exp(a)u,; (a,b) - =u, + exp(ue) ; 


(a,b) =a+ (a, dD) Ue. 


In this case, a subgroup W of A as in Theorem 6.1 can be described 
explicitly as follows: for every y € Hom(C,C*), there is a y* € W defined by: 


y*(b) =b, for every b€ B, and 
y* (exp (cuz) ) = y(c)exp(cuz) for every c€ C. 


The functions exp(cu,) make up the group Q—exp(Hom(G,C)). Let 
t(a,b) denote the left translation by (a,b) on R. Then one verifies easily 
that 

y*t(a, = t(a,y(1)b). 


On the other hand, 
(a’, (a, b) (a’, b’)-* = (a, b’ + exp(a’)b —exp(a)b’). 


Hence the conjugation with y* on P is the conjugation with ¢(a’,b’) if and 
only if, for all a and 8, 

y(1)6 = (1—exp(a) )b’ + exp(a’)6, 
i.e., if and only if 6b’ and exp(a’) y(1). 

In particular, we conclude that A—Z(P)P. We shall see, however, 
that P is not a direct factor in A. Indeed, suppose that P is a direct factor 
in A. Then there is a homomorphism ¢: W— P such that y*¢(y*) € Z(P) 
for every y€ Hom(C,C*). Putting ¢(y*) = t(a,, b.), we see from the above 
that we must have 6, 0 and exp(a,) = y(1). The map y—>a, is therefore 
a homomorphism Hom(C,C*) C such that exp(o(y)) =+y(1), for every 
y € Hom(C, C*). 

For each a€C, define the element exp, of Hom(C,C*) by expa(c) 


ANALYTIC GROUP. 129 


=exp(ac), for every c€C. The map a—o(exp,) is an endomorphism y 
of C. Since exp(y(a)) —exp(a), we conclude that y(a) —a is an integral 
multiple of 2zi, for every a€ C. Evidently, this implies that y(a) =a, for 
every @€ C, i.e., o(expa) =a. It follows from this that Hom(C,C*) is the 
direct product of the subgroup consisting of the exp, and the kernel, H say, of o. 

If n is any positive integer then, for every y€ Hom(C,C*), there is one 
and only one yn € Hom(C, C*) such that (yn)"—y; in fact, yn(c) =y(c/n), 
for every c€ C. It follows that if h€ H then also h,€ H. Since h(1) —1, 
for every h€ H, we conclude that h(q) —1, for every rational number q. 
Hence every y € Hom(C,C*) coincides on the group Q of the rational numbers 
with an exp,. But this is a contradiction. For instance, write C—Q-+ D, 
where D is a Q-subspace of C such that QM D= (0). Define the element y 
of Hom(C,C*) as follows: y(d) =1, for every d€ D; y(q) =1, for every 
rational number g that can be written with an odd denominator; y(q) 
=exp(2ziq), whenever q can be written with a power of 2 as denominator. 
Since this y does not coincide with an exp, on Q, we have reached a contra- 
diction. Thus P is not a direct factor of A. 


7. Representations as a closed subgroup of a full linear group. The 
existence of a right stable basic subalgebra of R leads to a simple proof of 


the following result which extends (to the complex case) and sharpens a 
result due to Goto [1, Th. 9]. 


THEOREM 7.1. Let G be a real or complex analytic group, and let p 
be an analytic representation of G with finite dimensional representation 
space V. Then there is a finite dimensional analytic representation o with 
representation space W such that VCW (as a G-module) and o(G) 1s 
closed in the group of all linear automorphisms of W. 


Proof. Let B denote a right stable basic subalgebra of R. We can find 
a finite dimensional subspace S of FR satisfying the following conditions: 


(1) SS is two-sidedly stable and, in the real case, S is stable under the 
complex conjugation ; 
the elements of S, together with the constants, generate a subalgebra 
of R containing B; 
S contains Hom(G,C) and the representative functions associated 
with the given representation p; 


if s€ 8, and §= Zaeqbals)y with b,(s) € B then every g€ Q for 
which b,(s) 40 belongs to 8. 


= 
9 


130 G. HOCHSCHILD AND G. D. MOSTOW. 


In fact, we can evidently find a finite dimensional subspace S, of R 
satisfying conditions (1), (2) and (3). Let S, be the space of the C-linear 
combinations of the elements of Q that occur with non-zero coefficients in the 
expressions for the elements of 8; as B-linear combinations of elements of Q. 
Since §, is finite dimensional, so is S,, and if S=S,-+ 8S, then S satisfies 
all the above conditions. 

Now consider the representation, ¢ say, of G by left translations on J, 
We claim that ¢(G@) is closed in the group of all linear automorphisms of 8. 
Let A denote the group of all proper automorphisms of R, and let Ag be its 
natural image in the group of all linear automorphisms of S. Since 4dsg is 
the algebraic group hull of ¢(@), it is closed in the full linear group. Hence 
it suffices to show that ¢(G) is closed in Ag. We define a closed subgroup T 
of Ag as follows: in the complex case, 7 consists of all B€ Ag such that 
B(exp(f)) =exp(B(f)), whenever f€ Hom(G,C) and exp(f)€ 8; in the 
real case, 7 consists of all the elements B€ Ag satisfying this condition and 
commuting with the complex conjugation. Now let « be an element of A 
whose restriction to S belongs to 7. We can define an algebra endomorphism 
a* of R such that a* coincides with a on B, while a*(exp(f)) —exp(a(f)), 
for every f€ Hom(G,C). Since B is stable under the right translations, it is 
clear that a* commutes with the right translations. Hence [2, Prop. 2.5] 
a* is a proper automorphism of Ff, and hence a perfect automorphism of RP. 
Moreover, in the real case, «* evidently commutes with the complex conju- 
gation, so that a* is a real perfect automorphism. Clearly, because of (4), 
a* coincides with « on §. Hence we conclude that, in the complex case, 
T —¢(G). In the real case, we conclude that 7 is the restriction image of 
the group P, of all real perfect automorphisms. It is clear from [38, Th. 5.1] 
that ¢(G) is the connected component of the identity in (P,)s. Thus, in 
either case, 6(G) is closed in 7, and therefore also in the full linear group. 

Finally, if n is the dimension of V over C, V may be identified with a 
G-submodule of the direct sum of n copies of S, by condition (3) and 
[2, Prop. 2.3]. If we take W to be the direct sum of n copies of § and let 
o be the representation of G on W obtained from ¢ in the natural fashion 
then o evidently satisfies the requirements of Theorem 7. 1. 


8. Nilpotent nuclei and algebraic structures. 


THEOREM 8.1. Let G be a real or complex analytic group, B a normal 
basic subalgebra of R, K the nucleus of G that is associated with B. Then B 
ts two-sidedly stable if and only if K is nilpotent. Moreover, in that case, 


t 
( 
if 
I 
tl 
fe 
01 
al 
Or 
B, 
wi 
the 
an 
cla 
thi 
thi: 
mo 
the 
the 
rep 
dim 
the 
toe 
of t 
ever 
will 


ANALYTIC GROUP. 131 


B coincides with the algebra of all representative functions associated with 


representations that are unipotent on K. 


Proof. Suppose first that B is two-sidedly stable. By definition, K is 

the kernel of the representation of G on B;. Let 8 be any left stable finite 
dimensional subspace of B. Since B is two-sidedly stable, all the represen- 
tative functions associated with the representation of G on S belong to B. 
Hence we may apply the argument of the beginning of Section 5 (used there 
for showing that the representation of U on S is unipotent) to conclude that 
the representation of K on S is unipotent. Since B is a basic subalgebra of 
R, we may choose S so that the representation of K (even of G@) on S is 
' faithful. Hence we conclude that K is nilpotent. 
Now suppose that K is nilpotent. Let B, be the normal basic subalgebra 
| of R constructed from K as in Section 3. It has been shown in [38, pp. 304- 
306] that the nilpotency of K implies that B, coincides with the algebra of 
all representative functions associated with representations that are unipotent 
on K. Hence B, is two-sidedly stable, and it follows from Theorem 4.1 that 
B,=B. This completes the proof of Theorem 8. 1. 


Remark. Let H be a maximal reductive analytic subgroup of G, and let 
U be the unipotent hull of G. Since H is determined up to a conjugation 
with an element of NV, the subgroups HU and HW of A are independent of 
the particular choice of H. Now suppose that G has a nilpotent nucleus K, 
F and let B be the corresponding two sidedly stable basic subalgebra of R. We 
claim that B uniquely determines an analytic isomorphism of HU onto G 
| that sends U onto K and leaves the elements of HN fixed. In order to see 
this, we consider the natural representations of HU and @ by proper auto- 
morphisms of B. Since HU leaves the elements of Q fixed, it is clear that 
| the representation of HU on B is faithful. On the other hand, we know that 
the representation of G on B is faithful, and that K is the kernel of the 
representation of G on B;. Hence the image of K in the group of the proper 
automorphisms of B must contain the image of U. By Corollary 5.1, we have 
dim(K) =dim(U). Hence we conclude that the images of U and of K in 
the group of the proper automorphisms of B coincide. Evidently, this suffices 
to establish our claim ; the isomorphism between HU and G goes via the group 
of the proper automorphisms of B. 

Observe that this result immediately implies Theorem 8.4 below; how- 
ever, we shall give a more direct proof of Theorem 8.4 later on, because this 
will lead to an explicit description of the set of all nilpotent nuclei. 

Now let G be a complex analytic group, and suppose that G has a faith- 


| 
g | 
8 | 
ig 
| 
is 
lf 
f 
lg 
in 
a 
et 
val 
B 
Se, | 


132 G. HOCHSCHILD AND G. D. MOSTOW. 


ful complex analytic representation p such that p(@) is an algebraic linear 
group. Let Bp denote the subalgebra of R consisting of all f€ R such that 
f°p™ is a rational function on p(G@). If o is another such representation of 
G then Bp = Bz if and only if the representation o°p™*: p(@) >o(G) isa 
rational representation of p(G). In order to see this, we need only observe 
that the representation o°p"' is rational if and only if its inverse poo is 
rational [2, Lemma 10.2]. We shall call such a subalgebra Bp of FR an 
algebraic structure of G. If S is any finite dimensional left stable subspace 
of Bp whose elements, together with the constants, generate Bp then the 
representation of G on S is a faithful representation of G yielding a faithful 
rational representation of p(G@), so that the image is an algebraic group, 
rationally isomorphic with p(G). By [2, Lemma 10.1], such a subspace exists 
in every Bp. Thus the subalgebras Bp correspond in a 1-1 fashion to the 
rational isomorphism classes of the faithful representations of G as an alge- 
braic linear group. 

We shall see that the algebraic structures of @ are precisely the two- 
sidedly stable basic subalgebras of R. This will follow easily from the next 
theorem. 


THEOREM 8.2. Let G be a complex analytic group, and let p be a faithful 
complex analytic representation of G such that p(@) is an algebraic linear 
group. Let K be the kernel of the semisimple representation associated with p. 
Then K is a nilpotent nucleus of G, and a complex analytic representation 
o of G yields a rational representation o°p* of p(G@) tf and only if o 1s 
unipotent on K. 


Proof. Let p’ denote the semisimple representation associated with p. 
Clearly, p’ ° p™* is a rational representation of the algebraic group p(@). Hence 
its kernel, p(K), is an algebraic subgroup of p(G@). Since p(X) is unipotent, 
this implies that p(K) is connected, simply connected, and nilpotent. Hence 
K is connected, simply connected, and nilpotent. On the other hand, p’(() 
is a fully reducible linear algebraic group, and it follows from [4, Th. 4.1] 
that p’(G) is therefore a reductive complex analytic group. Since G/K is 
isomorphic with p’(G@), it is now clear that K is a nilpotent nucleus of G. 

Now write G as a semidirect product H-K, with H reductive. Consider 
the corresponding decomposition p(G) =p(H)-p(K). Let V be the repre- 
sentation space of p, and let (0) =V,C--:CVna=V be a composition 
series of V. Since p(H) is fully reducible, there is a p(H)-module isomor- 


phism ¢: Vi/Vi:— V. Consider the algebra isomorphism y: e> $°e°¢" 
ia 


ANALYTIC GROUP. 133 


of (> Vi/Vi1z) onto H(V). The representation of G on V;/V;1 is the 
i=1 


semisimple representation p’ associated with p, and p’° p™ is the rational repre- 
n 

sentation of p(G) on SVi/Via. Now wWop’op™ is precisely the projection 
i=1 


of p(G@) onto p(H) that corresponds to the decomposition p(@) = p(H)- p(K). 
Hence it is clear that this projection is a rational group epimorphism of the 
algebraic group p(G@) onto the algebraic group p(H). It follows that the 
projection of p(G) onto p(K) is also a rational map (though not necessarily 
a group homomorphism). 

Now let o be a complex analytic representation of G. If oop is a 
rational representation of p(G@) then it is unipotent on p(K), by [5, Prop. 3. 2]. 
Thus o is unipotent on K, in that case. Conversely, suppose that o is uni- 
potent on K, and let f be a representative function associated with o. It 
follows from the elementary theory of representative functions (see [2, Section 
2]) that there are representative functions u; on H and representative functions 
v; on K such that the w; are associated with the restriction of o to H, the 


n 
v; are associated with the restriction of o to K, and f(hk) = dS ui(h)vi(k), 
i=l 


for all hE H and k€ K. The functions uj°p™* are analytic representative 
functions on p(/7) and, since p(/7) is reductive, they are rational functions 
on p(#7), by [4, Th. 5.2]. Since o is unipotent on K, the restriction of o ° p™ 
to p(K) is a rational representation of the unipotent algebraic group p(K). 
Hence the functions v;° p™* are rational functions on p(K). Since the pro- 
jections of p(G) onto p(#) and p(X) are rational maps, we may now conclude 
that fop? is a rational representation of p(G). This completes the proof 
of Theorem 8. 2. 

It is clear from Theorems 8.1 and 8.2 that each algebraic structure Bp 
is a two-sidedly stable basic subalgebra of R, and that the associated nucleus 
K is the kernel of the semisimple representation associated with p. Con- 
versely, let B be any two-sidedly stable basic subalgebra of PR. It is easily seen, 
as in our proof of Theorem 6.1, that every proper automorphism of F coincides 
on B with a left translation by an element of G. Let S be a finite dimensional 
two-sidedly stable subspace of B whose elements, together with the constants, 
generate B. Let p be the representation of @ by left translations on 8. Then 
p(G) = Ag and hence, by [2, Props. 2.6 and 2.9], p(@) is an algebraic sub- 
group of the group of all linear automorphisms of S. Now BC Bp, and 
since Bp is a basic subalgebra of R this implies that B= Bp. Thus we have 
the following result. 


ur 
at 
of 
a 
is 
n 
ce 
1e 
ul 
P, 
ts 
he 
0- 
xt 
\ 
ul 
ar 
On 
1s 
it, 
ce 
is 
ler 
re- 
on 
or- 


134 G. HOCHSCHILD AND G. D. MOSTOW. 


THEOREM 8.3. Let G be a complex analytic group. Then the algebraic 
structures of G are precisely the two-sidedly stable basic subalgebras of R. 
This gives a 1-1 correspondence between the algebraic structures and the nil- 
potent nuclei of G, as follows: given an algebraic structure on G, the corre- 
sponding nilpotent nucleus is the largest normal subgroup of G on which 
every rational representation of G (belonging to the given algebraic structure) 
is unipotent; given a nilpotent nucleus of G, the rational representations for 
the corresponding algebraic structure are precisely those complex analytic 
representations of G which are unipotent on the given nucleus. 


In view of Theorem 8.3, it is of interest to examine the set of the nil- 
potent nuclei of G. A description of this set is made possible by the fact, 
discovered by B. Kostant, that any two nilpotent nuclei of G are conjugate 
by an analytic automorphism of G (note that the example at the end of 
Section 4 shows that this is false for non-nilpotent nuclei). We observe first 
that it follows from the conjugacy (by inner automorphisms) of any two 
maximal reductive analytic subgroups of an analytic group that, if K is a 
nucleus of G and H is any maximal reductive analytic subgroup of G, then @ 
is the semidirect product H- K. 


THEOREM 8.4 (B. Kostant). Let G be a real or complex analytic group, 
and suppose that K and L are two nilpotent nuclei of G. Let H be a maximal 
reductive analytic subgroup of G, so that G=H-K=—H-L. Then there is 
an analytic automorphism a of G such that « leaves the elements of HN fixed 
and a(K) =L. 

Proof. Let & be the Lie algebra of G, Mt the maximum nilpotent ideal 
of &, § the Lie algebra of H, R and & the Lie algebras of K and L. We have 
the sum being semidirect, and 2CM. Hence 

Under the adjoint representation of G on ©, H operates semisimple on 
®. Let M be the maximum nilpotent normal analytic subgroup of G. Then 
HM is normal in H, and hence operates semisimply on &. On the other 
hand, M operates unipotently on ©. Hence we conclude that H M M operates 
trivially on ©. Hence HM M lies in the center of G and $M Mt lies in the 
center of 

For r€R, write r—€(r) + y(z), with SNM and 
Now, if &, write u=v+z2, with ve and R, and define f(u) 
=v-+y(z). Since y is a linear isomorphism of & onto &, f is a linear auto- 
morphism of &. Since § M Mi is in the center of G, we have [z, y] = [y(z), y], 
for every x€ R and every y € G, whence [f(u), y] = [u, y], for all u and y in G. 
Hence [f(w1), f(w2)] = ue], for all uw. in G But G]CH+RN& 
because both & and & must contain the radical of [G,@]. Hence f is the 


ANALYTIC GROUP. 135 


identity map on [G,G], so that the above implies that f is a Lie algebra 
automorphism of ©. 

Now let H* be the universal covering group of H. Then the appropriately 
defined semidirect product H*-K (in which H* operates on K via H) is the 
universal covering group of G. Hence our automorphism f of G defines an 
analytic automorphism f* of H*-K. Since f leaves the elements of § fixed, 
f* leaves the elements of H* fixed. Hence f* induces an automorphism « 
on G=H-K such that @ leaves the elements of H fixed and coincides with 
f*on K. Since f maps onto Q and coincides with the identity map on %, 
it follows that a(K) —Z and that a leaves the elements of HN fixed. This 
completes the proof. 

As an immediate consequence of Theorems 8.3 and 8.4, we obtain the 


following result. 


THEOREM 8.5. Let S and T be two complex linear irreducible algebraic 


groups, and suppose that o is a complex analytic isomorphism of S onto T. 


Then there exists a complex analytic automorphism a of S such that coa@ 
is a rational isomorphism of S onto T. 


Proof. Let B(S)C R(S) and B(T)C R(T) be the given algebraic 
structures of S and T, respectively. Then B(T’) oa is also an algebraic struc- 
ture of the analytic group S. Let LZ and K be the nilpotent nuclei of § that 
correspond to B(T’) oo and B(S), respectively. There is an analytic auto- 
morphism of such that a(K) =L. This implies that B(T) B(S), 
i.e, that o°@ is a rational isomorphism of S onto T. 

It is known from the theory of Abelian varieties that Theorem 8.5 does 
not extend to general (non-linear) complex algebraic groups. 

Now we proceed to describe the set of all nilpotent nuclei of the real or 
complex analytic group G. As before, let N denote the radical of G’, and let 
M be the maximum nilpotent normal analytic subgroup of G. Let MN and M 
be the Lie algebras of NV and M, respectively. Let H be a maximal reductive 
analytic subgroup of G. We have seen in the proof of Theorem 8.4 that HN M 
lies in the center of G. Since the maximal reductive analytic subgroups of G 
are conjugate under inner automorphisms, it follows that HM M is actually 
independent of the choice of H, and thus is a uniquely determined closed 
central subgroup P of G. Let $8 denote the Lie algebra of P. We denote 
by Hom (M/ + the space of all linear maps of M into that send 
$B+M onto (0). Finally, let 7 denote the radical of G. 


THEOREM 8.6. Let G@ bea real or complex analytic group. Then G has 
a nilpotent nucleus if and only if T/M is reductive. In that case, the set 


C 
- 
h 
C 
f 
t 
0 
a 
), 
8 
) 
n 
n 
) 
) 


136 G. HOCHSCHILD AND G. D. MOSTOW. 


of all nilpotent nuclei of G has the structure of an affine space, with 
Hom (M/(P +2), as the underlying vector group. 


Proof. In the real case, [8, Th. 5.3] implies that there is a two-sidedly 
stable basic subalgebra of RF if and only if T/M is reductive. In the complex 
case, the same result is contained in [4, Th. 5.4]. Hence the first statement 
of Theorem 8.6 follows from Theorem 8.1. 

Now let K be a nilpotent nucleus of G, and write G=H- K, as before. 
If & is the Lie algebra of K then, as we have seen in the proof of Theorem 8. 4, 
M is the semidirect sum Now let 6€ Hom(M/(P + PB). Let 
2 be the subspace of Yt consisting of the elements ¢(x) + 2, where x ranges 
over &. Noting that $8 lies in the center of & and that ¢ annihilates [G, &], 
we see that & is an ideal of G. Clearly, 6+2—H+R—6, and HNL 
= (0). The proof of Theorem 8.4 shows that there is an analytic auto- 
morphism « of G such that a leaves the elements of H fixed and «(K) is the 
analytic subgroup L of G whose Lie algebra is 2. Thus L is a nilpotent 
nucleus of G. We write L—¢-K. One checks immediately from the 
definition that, if y is any other element of Hom(2t/($ + MN), WB), then 
and that 0-K =K. Moreover, ¢-K —K implies 
that ¢ = 0, because Mt = ¥ + RK. Finally, it is clear from the proof of Theorem 
8.4 (where we introduced the map ¢) that, given K and L, there is an element 
in Hom(M/($ + MR), PB) such that d6-L—K. This completes the proof 
of Theorem 8. 6. 

In particular, we note that G has only one nilpotent nucleus if and only 
if either $B — (0) or P+ 9XH—Mt. The first alternative means that M is a 
nucleus. The second alternative means that N is a nucleus, i.e., that @/(’ 
is reductive (see [4, Th. 5.2] and [2, Ths. 11.1 and 9.1]). 

It is clear from Theorem 8.3 that, if G is a complex analytic group, 
Theorem 8.6 describes the algebraic structures of G, simply by reading 


‘algebraic structure’ for ‘nilpotent nucleus,’ throughout. 


REFERENCES. 


[1] M. Goto, “ Faithful representations of Lie groups II,” Nagoya Mathematical Journal, 
vol. 1 (1950), pp. 91-107. 

[2] G. Hochschild and G. D. Mostow, “ Representations and representative functions of 
Lie groups,” Annals of Mathematics, vol. 66 (1957), pp. 495-542. 

[3] , II, ibid., vol. 68 (1958), pp. 295-313. 

[4] ———., III, ibid., vol. 70 (1959), pp. 85-100. 

[5] G. D. Mostow, “ Fully reducible subgroups of algebraic groups,” American Journal 
of Mathematics, vol. 78 (1956), pp. 200-221. 


LINEARE GRUPPEN UBER LOKALEN RINGEN.* 


Von WILHELM KLINGENBERG. 


1. Resultate.7 


1.1. Wir betrachten einen (kommutativen) lokalen Ring L. Das grosste 
Ideal von LZ wird mit J bezeichnet. Dan ist L* — L—L eine Gruppe unter 
der Multiplikation. Falls J ein Ideal in L ist, J AL, so ist L/J wieder ein 
lokaler Ring. Es bezeichne 


(1) gs. L> L/J 


den natiirlichen Homomorphismus von L auf L/J. 

Unter einem n-dimensionalen Vektorraum iiber L, V=V,(L), verstehen 
wir einen Z-Modul isomorph zu LZ". Unter einem m-dimensionalen Unter- 
raum von V verstehen wir einen Untermodul U von V, der direkter Summand 
ist und isomorph zu L”. 

Die allgemeine lineare Gruppe in n Variablen tiber L, GL(n,L), ist 
definniert als die Gruppe der lineare Automorphismen von V — V,(L). 

Sei J ein Ideal in Z. (1) bestimmt den natiirlichen Homomorphismus 


(2) gs: Vn(L) > V,(L/). 


Hier lassen wir auch J = JZ zu; in diesem Falle soll V,(L//) dex. 0-Vektor- 
raum bezeichnen. (2) bestimmt den natiirlichen Homomorphismus 


(3) hy: GL(n, L) > GL (n, L/J) 


mit der Eigenschaft (hyo) g; = fiir alle o€ GL(n,L). Im Falle J=L 
soll GL(n, L//J) die Einheitsgruppe / sein. Unter der Ordnung o(X) eines 
Vektors Y € V,(Z) verstehen wir das kleinste Ideal J mit g,;X =0. Unter 
der Ordnung o(c) eines Elements o€ GL(n,L) verstehen wir das kleinste 
Ideal J so, dass hyo € Zentrum GL(n, L/J). Unter der Ordnung o(G@) einer 
Untergruppe G von GL(n,L) verstehen wir das kleinste Ideal J so dass 
C Zentrum GL (n, L/J). 

Sei #;, 1 Sin, eine Basis von V. Wenn, fir Ye V, ¥ = > E£,2;, so 


ist o(X) gleich dem von den a2;, 1Si=n, erzeugten Ideal. Wenn, fiir 


* Received September 19, 1960. 
+ Die wichtigsten Resultate der Arbeit wurden angekiindigt in [6]. 


137 


ith 
dly 
lex 
ent 

re, 
4, 
Let 
1k 
phe 
nt 
he 
en 
ies 
nt 
ot 

a 
1g 
of 

| 


138 WILHELM KLINGENBERG. 


o€ GL(n,L), cL; => so ist gleich dem von den ay, Aj, und 
Ay; — 4;;, 1 S1,j7 Sn, erzeugten Ideal. Die Ordnung o(G) einer Untergruppe 
G von GL(n,L) wird erzeugt von den Ordnungen o(c), o€ G. 


1.2. Sei J ein Ideal in LZ. Unter der allgemeinen Kongruenzunter- 
gruppe modJ von GL(n,L), GC(n, L, J), verstehen wir die Gruppe 


(4) GC (n, L,J) =h,* (Zentrum GL (n, L/J)). 


Offenbar ist GC(n,L,J) eine invariante Untergruppe von GL(n,L) der 
Ordnung J. Insbesondere haben wir GC(n, L, L) = GL(n, L) und GC(n, L, 0) 
= Zentrum GL(n,L) isomorph L*. Hier bezeichnet 0 das 0-Ideal in L. 

Sei J ein Ideal in L. Unter der speziellen Kongruenzuntergruppe mod J 
von GL(n,L), SC(n,L,J), verstehen wir diejenige invariante Untergruppe 
von GL(n,L), die erzeugt wird von den Transvektionen der Ordnung C J, 
d.h., von den Transvektionen in GC(n,L,J). Eine Transvektion + ist dabei 
ein Element aus GL (n, L), fiir das es einen Unterraum H der Kodimension 1 
(kunz: Hyperebene) in V gibt so, dass r}7 = Identitaét und so, dass 7X — XY € H 
fiir alle X€ V. 

Offenbar ist SC(n, LZ,J) eine invariante Untergruppe von GL(n,L) mit 
einer Ordnung CJ, die in GC(n,Z,J) enthalten ist. Insbesondere ist 
SC (n, L,0) = # = Hinheitsgruppe. Fiir SC(n,Z,LZ) schreiben wir auch 
SL(n,L) und nennen diese Gruppe die spezielle lineare Gruppe in n Vari- 
ablen tiber L. 

1.3. Lemma. Zwei Vektoren A und B von V=V,(L) haben dann und 
nur dann dieselbe Ordnung 0(A) =0(B), wenn es ein Element o € GL(n, L) 


gibt so, dass cA = B. 

Hieraus folgt 

Satz 1. Zwet Transvektionen 7,, 72 aus GL(n, L) von derselben Ordnung 
0(71) =0(r2) sind konjugiert in GL(n,L). 

Ercanzunc. Falls n=3 und 0(7,;) =0(72) ein Hauptideal ist, so sind 
7, und +, schon konjugiert in SL(n, L). 


1.4. Hiermit lassen sich die speziellen Kongruenzuntergruppen folgen- 
dermassen charakterisieren : 
THEOREM 1. Set J ein Ideal in dem lokalen Ring L und sei G eine 


Untergruppe von GL(n,L). Folgende Aussagen sind dquivalent: 
(a) G=SC(n,L,J). 


LINEARE GRUPPEN. 139 


(b) G besteht aus den Elementen o€GL(n,L) mit deto=—1 und 
hyo = Identitat. 
(c) G=gemischte Kommutatorgruppe Komm(GL(n, L), GC(n, L,J)). 
Fiir n=2 wird hierbei L/I AF, vorausgesetzt. 
Korotuar. Sei G eine Untergruppe von GL(n,L). Folgende Aussagen 
sind dquivalent: 
(a) G=SL(n,L). 
(b) G besteht aus den Elementen o€ GL(n,L) mit deto—1. 
(c) Gist die Kommutatorgruppe von GL(n,L). Hier wird fiir n=2 
vorausgesetzt: L 
THEOREM 2. Sei J ein Ideal in dem lokalen Ring J. 
GO(n, L, J)/SC(n, L, J) ist isomorph zu der Untergruppe U(n, L, J) 
von L* K(L/J)*, die gebildet wird von den Elementen (a,b) 
€ L* (L/J)* mit gsa = b". 
Setze HC(n, L,J) =GC(n, L,J)N SL(n,L). Dann rst 
HC (n, L,J)/SC (n, L, J) 
isomorph zur Gruppe E,((L/J)*) der n-ten Hinheitswurzeln in 
(L/J)*. 
Korotiar. GL(n,L)/SL(n, L) ist tsomorph zu L*, Zentrum GL(n, L) 
ist isomorph zu L*. Zentrum SL(n,L) ist tsomorph zu E,(L*). 
1.6. GC(n,L,J) und SC(n,L,J) sind invariante Untergruppen der 
Ordnung J. Nach Theorem 2 ist GC(n,L,J)/SC(n, L,J) kommutativ, also 
ist jede Untergruppe G von GL(n, LZ), die der Beziehung 


(5) GC (n, L,J) GD SC (n, L, 7) 
gentigt, eine invariante Untergruppe der Ordnung 0(G@) CJ. 


Das Hauptergebnis der vorliegenden Arbeit ist nun, dass umgekehrt jede 
invariante Untergruppe G@ der Ordnung 0(G) =J der Beziehung (5) geniigt. 
Wir werden sogar allgemeiner beweisen, dass dies fiir die unter SL(n, L) 
invarianten Untergruppen G von GL(n, L) gilt. 

Zunichst beweisen wir 


Satz 2. Sei (ta), , eine Menge von Transvektionen ra der Ordnung 
0(tw) Die von den ra erzeugte, unter SL(n, L) invariante Untergruppe 


140 WILHELM KLINGENBERG. 


G in GL(n,L) ist gleich SC(n,L,J), wo J das von den Ja, a€ A, erzeugte 
Ideal in L ist. Fiir n=2 setzen wir voraus: char(L/I) #2. 


Sodann beweisen wir den fundamentalen 


Satz 3. Sei G eine unter SL(n, L) invariante Untergruppe von GL(n, L) 
der Ordnung 0(G) =J. Dann enthalt G die Gruppe SC(n,L,J). Hier 
setzen wir fiir n=2 voraus: char(L/I) 42 und 

1.7%. Durch Zusammenfassung der vorstehenden Ergebnisse erhalten wir 
den folgenden Satz iiber die Struktur der allgemeinen wud der speziellen 


linearen Gruppe iiber einem lokalen Ring L: 


THEOREM 3. Sei L ein lokaler Ring. 


(i) Hine Untergruppe G von GL(n, L), die invariant ist unter SL(n, L), 
bestimmt ein Ideal J von L so, dass (5) gilt. Umgekehrt ist jede 
Untergruppe G von GL(n,L), die den Bezehungen (5) geniigt, 
eine invariante Untergruppe von GL(n, L) der Ordnung 0(G) =J. 

(ii) Hine invariante Untergruppe G von SL(n,L) bestimmi ein Ideal 
J von L so, dass 


(6) HC(n, L, J) = GC(n, L, J) N SL(n,L) > GD SC(n,L, J) 


gilt. Umgekehrt ist jede Untergruppe G von SL(n,L), die den 
Beziehungen (6) genigt, invariant und von der Ordnung J. 


Fiir n= 2 setzen wir voraus: char(L/I) ~2 und L/I¥F,. 


(Dieudonné [4], [5]) Sei L ein kommutativer Korper. 


(i) Fir eine Untergruppe G von GL(n,L), die invariant ist unter 
SL(n,L), gilt eine folgenden Beziehungen: 
GL(n,L) DGD SL(n,L), 
Zentrum GL(n,L)D GD E. 
Umgekehrt ist jede Untergruppe G von GL(n,L), die einer der 
Beziehungen (7%) geniigt, invariant in GL(n,L). 
(ii) Die invarianten Untergruppen G von SL(n,L), GASL(n,L), 
gehoren zu (Zentrum GL(n, L)) A SL(n, L) = Zentrum SL (n, L). 
Hier setzen wir fiir n=2 voraus: char LD~2 und LAF,. 


1.8. Wir koénnen auf Grund von Theorem 2 und 3 folgendes feststellen: 


LINEARE GRUPPEN. 141 


(a) Die unter SLZ(n,L) invarianten Untergruppen G von GL(n,L) 
der Ordnung J geniigen den Bedingungen (5), das heisst, GC(n,L,J) und 
SC (n, L, J) sind die grésste bzw. die kleinste unter SL(n, L) invariante Unter- 
gruppe der Ordnung J in GL(n, L), und jede zwischen diesen beiden Gruppen 
gelegene Gruppe @ ist invariant in GLZ(n,Z) und von der Ordnung J. 


(b) Jede unter SL(n, LZ) invariante Untergruppe G von GL(n,L) (die 
dann auch invariant ist unter GZ(n,L)) ist bestimmt durch ihre Ordnung 
0(G) =J und durch die Gruppe G/SC(n, L,J), die eine Untergruppe der 
in Theorem 2 eingefiihrten kommutativen Gruppe U(n,L,J) ist. 


Das Entsprechende gilt fiir die invarianten Untergruppen von SL(n, L). 


1.9. Fiir den Fall L—Z/(p"), p eine Primzahl, sind die vorstehenden 
Ergebnisse bewiesen von Brenner [2]. 

Fiir den Fall, dass Z ein verallgemeinerter Bewertungsring ist, das heisst, 
ein kommutativer Ring mit Eins, fiir den die Ideale total geordnet sind, sind 
vorstehende Ergebnisse bewiesen in [7]. 

Wie schon in [7], schliessen wir uns in den Bezeichnungen und den 
Anordnungen der Beweise der Darstellung an, die Artin [1] von der Theorie 
der linearen Gruppen gegeben hat. 


1.10. Die vorstehenden Ergebnisse bleiben im wesentlichen giiltig, wenn 
man lineare Gruppen GL(n,L) tiber nichtkommutativen lokalen Ringen L 
betrachtet. Ein nichtkommutativer lokaler Ring ist, nach Cartan-Hilenberg 
[3], ein nichtkommutativer Ring Z mit Eins und einem gréssten Ideal J; 
I ist zweiseitiges Ideal; L/I est ein nicht notwendig kommutativer Korper. 
Wir wollen ferner voraussetzen, dass jedes Linksideal J von Z auch Rechtsideal 
ist. Die Klasse dieser Ringe unfasst, fiir 70, die nichtkommutativen 
Korper. 

Die im folgenden gefiihrten Beweise bleiben, jedenfalls fiir n= 3, auch 
fiir die linearen Gruppen GL(n, LZ) iiber einem solchen nichtkommutativen 
lokalen Ring giiltig; nur beim Beweis der Beziehung: 


SC(n, L,J) = Komm(GL(n, L), GC (n, L,7)), 


Theorem 1(c), sind einige zusitzliche Uberlegungen nétig, die damit zu- 
sammenhingen, dass man in Theorem 1(b) den Begriff der Determinante 
verfeinern muss. Und zwar tritt an die Stelle der gewéhnlichen Determinante 
eine fiir jedes Ideal J von L erklirte J-Determinante: 


det,: GC (n, L, J) > L(J,n) 


142 WILHELM KLINGENBERG. 


wobei  L(J,n) (L*, Zentrum (L/J)*) 
eine Untergruppe von L*/Komm(L*) ist. Fiir J =L stimmt det, tiberein 
mit der von Dieudonné [4], [5] eingefiihrten Determinante tiber nichtkommu- 
tativen Korpern. 

In Theorem 2 ist entsprechend zu lesen: GC(n,L,J)/SC(n,L,J) ist 
isomorph zu der Untergruppe der Paare (a,b) in L(J,n)X Zentrum(L/J)* 
mit gja—=b". Speziell fiir J LZ and J liefert dies: GL(n, L)/SL(n, L) 
ist isomorph zu L*/Komm L*, Zentrum GL(n, L) ist isomorph zu Zentrum L*. 

Als Korollar erhalt man die Sitze von Dieudonné [4], [5] iiber die 
Struktur der linearen Gruppen iiber nichtkommutativen Korpen. 

Die vollstindigen Beweise dieser Verallgemeinerung sollen anderweitig 


veroffentlicht werden. 


2. Beweise. 


2.1. Beweis des Lemmas. Falls es ein o€ GL(n,L) gibt mit oA = B, 
dann ist gyA = 0 aquivalent mit g,;B = gjoA = hyog,A = 0, also 0(A) = 0(cA) 
=0o0(B). 

Sei nun umgekehrt 0(A) = 0(B) =J. Beziiglich einer Basis 1 SiS n, 
sei A= E,a;. J wird erzeugt von den a;. Sei va, eine mini- 
male Anzahl von erzeugenden Elementen von J. Dann ist also Ua= > raja; 
und a;= > digug und ta = rajbjgug, das heisst rajbjg — Sag)ug = 0. Da 
die we ein minimales System von Erzeugenden bilden, kann keines der Ele- 
mente > rajbjg — Sag zu L* gehéren, mit anderen Worten, > rajbjg— Sag = 0 
mod J. 

Wir setzen Fo = > Hjbja und behaupten, dass die Fa linear unabhingig 
sind mod/. In der Tat, aus F'gcg = Ejbjgcg =0 mod I folgt bjgcg — 0 
mod J und also rajbjgcg = Ca =0 modJ. Die Fy kénnen zu einer Basis F; 
erganzt werden. Denn sei 1=i=n, eine Basis von V und Fe= > Eifia. 
Die (n, p)-Matrix (fie), hat den Rang p, sie 
lasst sich also mod zu einer (n,n)-Matrix vom Rang n erweitern. Wenn 
dann (fjj), 1S%i,j<n, ein Urbild einer solchen Erweiterung ist, so setze 
man F;=> Efi; Offenbar ist A = > Fata. 

Da o(B) =o0(A) =J, kénnen wir auch B in der Form B=} Gata 
schreiben, wo die Gq linear unabhingig sind modJ. Die Ga kénnen also zu 
einer Basis G; erginzt werden. Wir erklaren o€ GL(n,L) durch oF; = Gi. 
Dann wird oA = B. 


2.2. Beweis von Satz 1. Sei r eine Transvektion, H eine Hyperebene 
mit 7|~—Identitat. Sei ¢ eine Linearform mit $7(0) =H. Sei B ein 


LINEARE GRUPPEN. 143 


Vektor mit ¢(B) —1. Dann ist Y —Bo(X) € H fiir alle X€ V und daher 
—rBo(X) =X —B¢(X). Wir setzen A—=rB—BEH. Damit wird 


(8) 


Wir sagen: Der Vektor A in (8) gehért zu der Transvektion +. A ist durch 
; (oder genauer: durch H) nur bis auf ein Element c¢€ L* bestimmt; denn 
wenn ¢ ersetzt durch cd, dann geht A iiber in Ac. Offenbar ist 0(r) =0(A), 
wenn A zu 7 gehort. 

Wir machen jetzt die Voraussetzungen von Satz 1. 7, und 7, stellen 
wir dar durch 


B, sei so gewahlt dass ¢-(B,) =1. Da o(r1) =0(r2), ist 0(Ai1) =0(A2). 
Nach dem Lemma gibt es also ein o€ GZ(n,L) mit cA,—A,. Aus dem 
Lemma fiir n—1 folgt, dass man zugleich oH, =H, erreichen kann, wo 
H,=¢,"(0), und dann kann man noch erreichen oB, = B,. Dann ist jedoch 
12 =o07T,07. Die Erginzung zu Satz 1 beweisen wir in 2.4. 


2.3. Beweis von Theorem 1. 


2.3.1. Wir bezeichnen die durch (b) definierte Gruppe mit H, und die 
durch (c) definierte Gruppe mit Ky. 


2.3.2. Offenbar gilt fiir jede Transvektion + mit o(r)C J: detr—1 
und hjyr = Identitait, also SC(n,L,J) C Hy. 


Sei nun umgekehrt o ein Element aus H;. Falls J = JL, das heisst, falls 
SC(n, L,J) = SL(n,L), dann kann man bekanntlich (vgl. Dieudonné l.c. 
oder Artin l.c.) eine Matrixdarstellung von o durch Multiplikation mit 
geeigneten Elementen von SLZ(n,Z) von rechts und von links auf die Form 
diag(1,1,---,a) bringen, wo a= deto—1 ist. 

Sei jetzt o€ Hy mit J CI. Sei o dargestellt durch die Matrix ( (aj) ). 
Da hyo = Identitiat, ist ay, € J fiir ~k und a,;—1¢€J. Indem wir von links 
und von rechts mit geeigneten Elementen aus SC(n,LZ,J) multiplizieren, 
kénnen wir ((a,)) auf die Gestalt diag(1 + -,1+un), J, 
bringen. 

Die Formel 


-(“ 


= 


144 WILHELM KLINGENBERG. 


zeigt, dass auch das Element diag((1+u)%,1,---,1+4u), wed, zu 
SC(n,L,J) gehért. Daher kénnen wir durch Multiplikation mit geeigneten 
Elementen von SC(n, L,J) aus ((ax)) sogar die Matrix diag(1,1,- - -,1-+ w), 
w€ J, erhalten. Wegen det( (ai) ) = det diag(1,1,---,1+w)=1+w=1 
ist w= 0, also SC(n,L,J), Hy C 8SC(n,L, J). 


2.3.3. Ein erzeugendes Element von Ky hat die Gestalt pop™*o* mit 
p€GL(n,L) und GC(n,L,J). Da hjyo€ ZentrumGL(n,L/J), ist 
hypop to = = Indentitit. Da ferner det pop*o* =1, haben 
wir K; 


Sei nun umgekehrt 7 eine Transvektion aus SC(n,L,J). Wir schreiben 
7 in der Form (8) mit A€ ¢1*(0) =H. Wir betrachten zunichst den Fall 
n=8. Dann ist also dimH 2. Wir behaupten, dass wir dann A in der 
Form A = A,— A, schreiben kénnen mit 0(A,) =0(A,.) =0(A) und A, € H, 
A,€ H. 

In der Tat, es sei A= wo Ei, 1 SiSn—1, eine Basis fiir H 
ist. Falls n —1=— 2m, so setzen wir 


A,= Zz (2; — + > 
A, = + EB + 425) 


Falls n —1—2m-+ 1, so ersetzen wir in den vorstehenden Ausdriicken den 
Summanden j —1 durch 


Ey (Gomes + A, — + E,(a, + Qoms1) 
Ey (a, + + (a) + de + omer) + Bemis (Gems +41). 


Wir definieren nun 7, (r 1,2) durch (9) mit ¢-—¢. Nach Satz 1 gibt es 
o€GL(n,L) mit Also 


T= =o7,0 € Ky, SC(n, L,J) C Ky. 


Im Falle n = 2 benutzen wir die Voraussetzung dass L/IAF,. Ks gibt 
dann ein c€ L* so, dass A = A, — A,, A, =A(1 +c), A, — Ac, 0(A) = 0(A;) 
=0(A.). Nach Satz 1 gibt es ein o€ GL(2,L) so, dass or,0717, 1 € Ky, 
also SC(2,L,J)C Ky. 


2.4. Beweis von Satz 1, Erginzung. Wir verwenden die Bezeichnungen 
aus 2.2. Insbesondere ist also +, (r=1,2) dargestellt durch (9). Sei 
H,=¢,*(0). Nach Voraussetzung ist 0(r,) =0(A,) = (a) ein Hauptideal. 
Das heisst, wir konnen A, in der Form A,=E,a, 0(E,) =L, schreiben. Da 
n= 3, also dimH,—n—1=2, gibt es in H, einen Vektor F,, der linear 


(j=1,- 


LINEARE GRUPPEN. 145 


unabhingig modJ von £, ist, r=1,2. Wir ergiinzen F, zu einer Basis 
von H, und erginzen, fiir ein beliebiges c€ L*, H,, F,c zu einer Basis von H;. 
Damit erkennen wir, dass es stets o€ GL(n,L) so gibt, dass cH, — EF, (also 
oA, = Az), oF, =F und oH, = Hz, = B:, also Wenn wir 
c€ L* geeignet wihlen, kénnen wir erreichen, dass deto—1, das heisst, 
o€ SL(n, L). 


2.5. Beweis von Theorem 2. Wir betrachten den Homomorphismus 
f: o€ GCO(n,L,J) (deto, hyo) € L* X Zentrum GL (n, L/J). 


Die rechts stehende Gruppe ist isomorph L* x (L/J)*. Nach Theorem 1 ist 
Kern(f) = SC (n, L, J). 
Sei GC(n,L,J7) und deto—ae L*. Dann ist 


gia = gs det = det hyo = det diag (b,b,- - -,b) = b", bE(L/J)*. 


Zu einem 6€(L/J)* wahle man ein b’€ L* mit gjb’=b. Dann stellt 
die Matrix ((0’8;,)) ein Element aus GC(n,LZ,J) dar mit gy deto = b". 


2.6. Beweis von Satz 2. 


2.6.1. Wir betrachten zunachst den Fall:n = 3 und nehmen an, das die 


Menge (ra), , aus einem einzigen Element r besteht. + sei dargestellt durch 
(8) mit A> > Fa, 1Si=n—1, wobei die Vektoren 1S1Sn—1, 
eine Basis fiir die Hyperebene H (0) bilden. J—o(r) =0(A) wird 
erzeugt von den Elementen a, 1=1=n—1. 


Wir definieren A, und A, durch 


n-1 n-1 n-1 

Wir haben A,—A—A, und 0(A) ~0(A,). Wir erkliren die Transvek- 
tionen tr, (r=1,2) durch (9). Nach Satz 1 gibt-es o€ GL(n,L) mit 
= Da man speziell so wahlen kann, dass = E,— E;,, (i << n—1), 
oE',_, = Ey, erkennt man, dass es sogar in SL(n, L) ein o gibt mit 7; oro", 
d.h. r,€ G@= die von + erzeugte unter SL(n, L) invarianten Untergruppe von 
GL(n,L). Dann auch r2—77,1€ G, und indem wir auf 7, einen ent- 
sprechenden Schluss anwerden erkennen wir, dass G eine Transvektion enthalt 
mit zugehérigem Vektor 

Auf Grund der Ergiinzung zu Satz 1 kénnen wir'sagen, dass G alle 
Transvektionen enthalt mit zugehdrigem Vektor Fa,c(—(Ec)a,), wo 
=L und ce L*. Offenbar ergibt sich genau so, dass G auch die Transvek- 


10 


u 
n 
); 
1 
it 
st 
n 
n 
1 
t 
) 
1 
j 


146 WILHELM KLINGENBERG. 


tionen mit zugehérigem Vektor 1 [iS n—1,0(L) =L,cé€ L*, enthalt. 
Wenn c€ J, soc—1¢€L*. Da Hajc = Ea;(c—1) + Kaj, enthalt G auch die 
Transvektionen mit Vektor Haj;c,o(#) =L, c€ L, und durch Produktbildung 
folgt: G enthilt alle Transvektionen mit Vektor Hb, wo b= > ajc; ein 
beliebiges Element aus dem Ideal J —o0(r) ist. 

Sei schliesslich 7’ eine beliebige Transvektion der Ordnung o(r’) =J’ 
C o(r) =Jd. 
(11) 7X =X + A’d’(X) 


sei eine Darstellung von 7’. Wir schreiben A’ = > E’,a’;,, 1 [t= n—1, wobei 
die 1 Sixn—1, eine Basis von H’—¢’-*(0) bilden. Wegen 


gehoren die Transvektionen 
(12) =X + (X) 
zu G, also auch 7’ = G. 


2.6.2. Wir betrachen jetzt den Fall n= 3 mit einer beliebigen Menge 
(ra)aea- Sei J das von den Ja=o(ra) erzeugte Ideal. beliebige 
Transvektion +r’ mit o(r’) J’ CJ besitzt eine Darstellung (11) mit 
A’ => J’ CJ. Die Uberlegungen aus 2. 6. 1 zeigen, 
dass G die Transvektionen 1 = i= n—1, der Form (12) enthialt, also auch 
= G. 

2.6.3. Wir betrachten den Fall » 2 und nehmen zunichst wiederum 
an, dass die Menge (ra),,, nur aus dem Element + besteht. Durch geeignete 
Wahl der Basis lasst such + in der Form 


(13) (5 


darstellen, wo (w) =o(r) ist. Dann reprisentiert auch 


ein Element aus der von r erzeugten, unter SZ(2, LZ) invarianten Gruppe G. 
Mit der Matrix (13) enthalt G also auch die Matrizen 


(15) were 


Da 2€ L*, haben wir fiir jedes c€ L die Darstellung 


(16) c= (c+ 1)*/2?— (c—1)?/2?. 


LINEARE GRUPPEN. 147 


Falls also c + 1¢€ L* und c—1€ L*, so haben wir aus (15) und (16), dass 
G auch die Matrix 


a) (0 1) 


enthilt. Falls dagegen c-+1€J oder c—1€ J, so ersetzen wir zunichst 
durch c’ =c-+ 1 beziehungsweise durch c’ =c—1. Dann ist c’+1¢€ L* und 
c’ —1€ L* und auf Grund von (15) und (16) enthalt G also eine Matrix 
(17), in der c durch c’ ersetzt ist. Da auch (13) zu G@ gehdrt, folgt, dass auch 
(17) zu G@ gehort. 

Da ein 1-dimensionaler Unterraum von V=V,(Z) in einen anderen 
1-dimensionalen Unterraum von V stets durch ein Element o€ SL(2,L) 
iibergefiihrt werden kann, folgt, dass G mit der Transvektion +, (13), der 
Ordnung o(r) = (w) auch alle Transvektionen 7’ der Ordnung o(r’) C o(r) 
=(u) enthalt. 

Wir betrachten jetzt eine beliebige Menge (ra), , von Transvektionen ta. 
Sei J das von den Ja ~0(ta) = (Ua) erzeugte Ideal. Eine beliebige Trans- 
vektion 7’ der Ordnung o(7’) = (u’) C J lasst sich darstellen durch (13) mit 
w’ statt u. w’ lisst sich schreiben als = Ca =O fiir fast alle A. 
Die von den ra erzeugte, unter SL(2,L) invariante Untergruppe G von 
GL(2,L) enthilt, wie wir soeben sahen, alle Transvektionen (17) mit Uaca 
an Stelle von uc. Also gehdrt auch das Produkt dieser Transvektionen zu G, 
d.h. 7’ € G. 


Beweis von Satz 3 fiir n= 2. 


.1. Angenommen, G enthailt das Element p der Form ) 
0 a+v 


1 —a— 
1 Transvektion 7. Dann gehért auch 


ptp rt: zu G, das heisst, G enthalt eine Transvektion der Ordnung 


(v) =K und daher, nach Satz 2, SC(n, L, K) C G. 
Wir betrachten den Fall, das 0(G@) =JCJ. Ein Element o€ G 
besitzt die Darstellung 


mit t,y,z€J —o(G), bE L*. Es ist o(c) = (x,y,z) von 2, y und z 
erzeugte Ideal in Z. Wenn wir zeigen, dass G mit o, (18), auch die Trans- 


vektionen 


(19) 1 y—z 1 
0 1 ' 0 1 


a 
g 
n 
J 
e 
e 
it 
h 
4 
e 


148 WILHELM KLINGENBERG. 


enthalt, deren Ordnungen offenbar gerade das Ideal o(o) erzeugen, dann haben 
wir auf Grund von Satz 2, dass SC'(2,L,0(c))C G. Da dies fiir jedes o€ G 
gilt, folgt SC (2, L,J) C G. 
Wir beweisen die Existenz der Transvektionen (19) in zwei Schritten. 
2.7.2.1. Da G invariant ist unter SZ(2,L), enthalt G mit o, (18), 
auch das Element 


— bx + by b? + bz —z? (kurz)\y 


Dann gehért auch o”: 


zu G. Wenn wir in (21) setzen: k = (x+ y)/26, so erhalten wir fiir o” € G 
die Darstellung 


+ be—(y? + 27)/2 (y?—2*)(y + 2)/4b —(b + 
b(y—z) b? + bz—(y’ + 2*)/2 


(kurz) ($ 


a 

0 

Dann gehort auch p’=70"-'r0” zu G. Wir finden fiir ’: 


a?— a?v(2 + v) —uv(1 + v)? 
0 


(22) 


Das Element 7€ SZ(2,L) sei erklart durch ( 


(a? — uv)" 


(23) 


— (ure) mit (0) = (2) = 


Aus 2.7.1, angewandt auf das Element p’, (23), folgt dass G die erste der 
Transvektionen (19) enthilt. 


2.7.2.2. Mit dem Element o, (18), enthalt G auch die Elemente 


Indem wir auf die Elemente (24) dieselben Uberlegungen anwenden, die wit 
auf das Element o, (18), in 2.7.2.1 angewandt haben, erkennen wir, dass G 
auch die Transvektionen der Ordnung (y—2z =z) enthalt, also die letzten 
beiden der Transvektionen (19). 


|| 


LINEARE GRUPPEN. 149 


2.7.3. Wir betrachten jetzt den Fall dass 0(G@) =J—JL. Dann enthalt 
G ein Element o der Form 


Mit dem Element r: ( ) aus SL(2, L) gehort dann auch p= 


—b— 


(26) 0 —ad? 


zu der Gruppe G. Um 2.7.1 anwenden zu koénnen, miissen wir d€ L* so 
wahlen, dass a-*d-? —ad?€ L*, d.h. 


a?d*—1€ L*. 


Da wir char(L/I) ~2 und L/I ~F, vorausgesetzt haben, existiert ein d€ L* 
mit (27) stets dann, wenn L/IJF; est, denn dann hat L/I wenigstens 6 
Elemente. Wenn (27) gilt, dann folgt aus 2.7.1 mit p, (26), dass G eine 
Transvektion der Ordnung L enthilt, also nach Satz 2, SL(2,L)C G. 


2.7.3.1. Es bleibt also der Fall Z/J =F; zu betrachten. Falls g;a?~1 
ist, dann gibt es ein d€ L* so, dass (27) gilt. Wir konnen uns also auf den 
Fall g;a = + 1 beschranken. 


Zuniachst kénnen wir dann d€ L* so wihlen, dass g;ad? ist. Neben 
dem Element p, (26), enthalt G auch das Element p’, das aus (26) entsteht, 
wenn man d ersetzt durch a-td-?. Dann gehért auch pp’ zu G. pp’ hat die Form 


1 
2§ 
pp’ ist also eine Transvektion der Ordnung (b). Falls (b) =Z, so haben 
wir mit Satz 2 SL(2,L,L) =SL(2,L)C G. Falls (b) C I, so haben wir mit 
Satz 2 SL(2,L,(b))C G. Dann gehort auch das Element 


0 a\f1l —b\ /0 a 
(‘ 1 )- ‘) 
zu G. 

Wir haben also jetzt ein Element o, (25), mit b=0 und ga=+1 
in G. Damit enthat G auch das Element p, (26), mit d—=1. Es ist also 
0(p) =J’ = (a®?—1) CI. Nach 2.7.2 ist dann jedenfalls SC (2, L,J’) C G. 

Wir betrachten die Falle: (i) J’=(a—1), (ii) J’=(a+1). Im 
Falle (i) wahlen wir fiir das Element o, (29), eine neue Basis so, dass o 


die Darstellung erhilt: 


(27) 
— 


150 WILHELM KLINGENBERG. 


1 1\/0 a\/4 (1—a)/2 
Es ist also hyo: ; er | Aus 2.7.1 folgt, dass SL(2,L/J’) C hyG. 


Im Falle (ii) waihlen wir fiir o, (29), eine Basis wie folgt: 


Ks ist also hyo: 9 a Aus 2.7.1 folgt wiederum SL (2, L/J’) C hyG. 


Da Kern(hy- | srin,z)) = SC(n, L, J’) C G, folgt SL(2,L) C G. 
2.8. Beweis von Satz 3 fiirn=3. 


2.8.1. Angenommen, G enthalt ein Element p so, dass pX¥ —X € H fiir 
alle X¥ € V, wo H ein Unterraum der Kodimension 1 ist. Behauptung: Dann 


enthalt G auch die Transvektion + der Form 
(32) = + (pU—U)$(X) 


wo ¢7(0) =H und U ein beliebiger Vektor H ist. In der Tat, setze 
7X =X+U0¢(X). Dann gehort auch zu G. Wegen ¢(pX) 
=¢(X) und ¢(p*U) = ¢(pU) =0 hat die Gestalt (32). 


2.8.2. Sei o€ Das Ideal o(o) CJ wird erzeugt von Ele- 
menten a€ L, deren jedes in folgender Weise beschrieben werden kann: Es 
gibt einen Vektor # der Ordnung o(#) —JL und eine Linearform ¢ der 
Ordnung L so, dass ¢(/) =0 und ¢(o/) =a. Wenn wir zeigen, wass, fiir 
jedes solche a, G eine Transvektion der Ordnung (a) enthalt, dann ist der 


Satz 3 bewiesen. 


Denn jedenfalls enthalt G dann, auf Grund von Satz 2, die Gruppen 
SC(n,L,o0(o)) fiir jedes o€ G, und da J—o(G) erzeugt wird von o(c), 
ao € G, folgt die Behauptung mit Satz 2. 


2.8.3. Sei o€ G(n,L). Sei # ein Vektor der Ordnung L und ¢ eine 
Linearform der Ordnung L mit ¢(/) =0 und sei ¢(oH) =a. Behauptung: 
Es gibt ein o’ in der von o erzeugten, unter SL(n,LZ) invarianten Unter- 
gruppe G von GL(n,L) und es gibt eine Basis £4, 1 Sin, mit dualer 
Basis ¢’;, 1 [i= Nn, so, dass = 0 fiir alle mit 1 [iS n—2 und 0, 
dass ¢’,(o’E’,) =a fiir ein k mit 1= k= n—1. 


2.8.3.1. Zum Beweis dieser Behauptung gehen wir aus von einer Basis 


LINEARE GRUPPEN. 151 


E, und dualer Basis 1 Sin, so, dass H= und ¢= dr. 
Die Matrix von o und die Matrix von o beziiglich dieser Basis sei bezeichnet 


durch 
Insbesondere ist also a= 1. 
Wir definieren die Transvektionen 7;, 7 An—1, durch 
(33) + (Z) (j~n—1). 
Da G invariant ist, gehort auch pj =o 'rj07; zu G. Wir finden 
(34) pj = Ey, + (ofr) (j,k An—1). 
Ferner, fiir 
(35) =o 1 =o 1 (1— jbn1j) — j. 


2.8.3.2. Wir betrachten zuniichst den Fall: Es gibt ein r, 2 rn, 
mit bp, € L*. In diesem Falle erklaren wir eine neue Basis durch: = 


Fiir 1 < n—1 und haben wir wegen (34) 


(36) prt’; — + ot (of;) = E’; ++ Miss i 


und fiir *4—1 haben wir wegen (35) 
(37) pul’, = pro Ly = (1 — — 
Fall (i): rAn—1. Aus (36) und (37) folgt: 
(pri) =0 fiir 1+< n—1; 


Wenn wir also H’,_, und E’, vertauschen und o’ = py setzen, so folgt, mit k =r, 


die Behauptung 2. 8. 3. 
Fall (ii): r=n—1. Aus (36) und (37) folgt 
=0 fiir n—13 (pnl’1) = 
Mit o’ =p», k—=n—1 folgt die Behauptungz 2. 8. 3. 


2.8.3.3. Wir betrachten den in 2.8.3.2 ausgeschlossenen Fall, dass 


bn € I fiir aller, Dann gibt es jedoch ein r, 2S r<n, so, dass 


b,,€ L*, wie man durch Reduktion modJ erkennt. Wir erkliren eine neue 
Basis durch 2’; = fiir iA r, und Aus (34) und (35) folgt 


152 WILHELM KLINGENBERG. 
(38) pil’, = fiir tAN—1, 
(39) pill’, = — — 
Fall (i): rn. Aus (38) und (39) folgt 
(pills) =0 fiir 1< n—1; = 


Wenn wir H’,, und £’, vertauschen und o’ ~p, setzen, so folgt, mit k =r, 
die Behauptung 2.8.3. 


Fall (ii): rAn. Aus (38) und (39) folgt 
=0 fiir 1+<n—1; = 
Mit o’ —p,, k =r folgt die Behauptung 2. 8. 3. 
2.8.4. Wir kommen jetzt zum Beweis der in 2. 8.2 aufgestellten Behaup- 
tung, dass die Gruppe G eine Transvektion der Ordnung (a) enthalt. Nach 


2.8.3 kénnen wir annehmen, dass wir eine Basis 1 [i= n, von V haben 
mit dualer Basis ¢;, 1=i1= 7, und ein Element c€ G so, dass 


(40) ¢n(oh;) =0 fiir 1<n—1; (cH,) =a fiir ein k, Sn—1. 
Wir betrachten die Transvektion 7: 

(41) tX =X + 

Fiir p=oro'r € G finden wir 

(42) + oF (o*X) —oL (X). 


Es ist also pX —X€ H, wo H die von -, aufgespannte Hyperbene 
ist. Daher folgt aus 2.8.1, mit UV = 1 Si n—1: G enthilt die Trans- 
vektionen mit dem Vektor p#,— 1Sisn—1. Wir 
unterscheiden zwei Fille: 


2.8.4.1. Hines der Elemente 1 Sin—1, gehort zu L*. 
Dann enthalt G also eine Transvektion der Ordnung LZ und mit Satz 2 folgt 
SL(n, L) C G, 0(G) =J =L, es folgt also die Behauptung von Satz 3. 


2.8.4.2. Die Elemente gehéren alle zu I. 
Wir bezeichnen mit K das von diesen Elementen erzeugte Ideal. Dann 
ist also K CI. K stimmt iiberein mit dem von den Elementen ¢,(cf;), 
1=iSn—1, erzeugten Ideal; denn hxo*gxH =gxH impliziert hxogxH 
=gxH und umgekehrt. Wegen Satz 2 und dem Schluss von 2.8.4 haben 
wir SC(n, L, K)C G. 


LINEARE GRUPPEN. 153 


Wir definieren eine Linearform y durch 
(43) W(oE;) =0 fiir n—1; =0; —1. 
Da die gxEi, 1 Sisn—1, durch hxo unter sich transformiert werden und 
K Cl, bilden die of, 1SiSn—1, zusammen mit E,, eine Basis fir V; 
daher ist y durch (43) wohldefiniert. 

Wir betrachten die Transvektion p: 
(44) p& =X + p(X). 
Offenbar ist o(u) C K, alsowé€ G. Dann gilt auch o' =p'c€ G. Wir finden 

Ey, 1<n—1;3 Enda (cE n+). 


Es ist also co H=H, wo H=¢,1(0). Folglich mit de L*. 
Mit der Tranvektion +, (41), erhalten wir fiir p’ =o'’ro’*r?€ G nach 

(42) den Ausdruck 

(45) =X + 


p, (45), ist also eine Transvektion der Ordnung 0(oL,d — D 
= (a). Nach Satz 2 enthilt G also eine Transvektion der Ordnung (a). 
Damit ist die Behauptung in 2.8.2 bewiesen, und also Satz 3 bewiesen. 


2.9. Das Theorem 3 ist eine Zusammenfassung der vorstehend bewie- 
senen Ergebnisse. 


UNIVERSITAT GOTTINGEN, GERMANY. 


LITERATURVERZEICHNIS. 


[1] E. Artin, Geometric Algebra, New York, 1957. 

{2] J. Brenner, “The linear homogeneous group,” Annals of Mathematics, vol. 39 
(1938), pp. 472-493. “The linear homogeneous group, II,” ibid., vol. 45 
(1944), pp. 100-109. 

[3] H. Cartan-S. Hilenberg, Homological Algebra, Princeton, 1956. 

[4] J. Dieudonné, Sur les groupes classiques, Paris, 1948. 

[5] 

[6] W. Klingenberg, “Linear groups over local rings,” Bulletin of the American 

Mathematical Society, vol. 66 (1960), pp, 294-296. 

, “Lineare Gruppen tiber verallgemeinerten Bewertungsringen,” Abhand- 

lungen aus dem Mathematischen Seminar der Universitat Hamburg (to 

appear). 


, La géométrie des groupes classiques, Berlin, 1955. 


[7] 


- 
h 
1e 
ge 
ir 
ot 
I. 
n 
1 
n 


ON DIFFERENTIAL EQUATIONS AND THE FUNCTION J,’ + Y,?.*? 


By Puitip HARTMAN. 


Introduction. Let J,—J,(t), Yu—Y,(t) denote Bessel functions of 
a non-negative order yp of the first and second kind, respectively, and let t 
be real. It is known that ¢(J,?-+ Y,”) is increasing or decreasing for ¢ > 0 
according as »< 4 or » >} and that (¢?—p»’)4(J,?+ Y,?) is increasing for 
t=>p»=0; cf. [9], p. 446. These facts are usually derived from Nicholson’s 
integral formula for Ju*+ Y,?, the proof of which is rather involved. It 
seems, therefore, of interest to obtain these assertions directly from simple, 
general theorems on differential equations. 

Let g=q(t) be a continuous, positive function for large ¢. General 
conditions (cf. [11]) are known which imply that 


(0.1) u” + q(t)u=0 


has a pair of real-valued solutions w—2(t),y(t) satisfying, as to, 
asymptotic formulae similar to 


(0.2) gz = expi + 0(1), 


(qiz)’ igt exp if + 0(1), 


where z=2-+iy. But these results give no information on the possible 
monotone character of g?|z| or on sharp bounds for g#|z| for all ¢-values 
under consideration. The object of Part I of this paper is to supply such 
information. It will be shown there that, under suitable conditions on 4, 
the number 1 is a bound for q | z |* and that this implies the monotony of | z |. 
The question of the monotony of q|z|* can be decided in many cases by 
examining the applicability of this result to the differential equation having 
gx, qty as solutions; cf. the change of variables (2.7) below. 

These results will be applied in Section 4 to the Bessel functions. It 


* Received November 15, 1960. 

1 This research was supported by the United States Air Force through the Air Force 
Office of Scientific Research of the Air Research and Development Command, under con- 
tract No. AF 18(603)-41. Reproduction in whole or in part is permitted for any purpose 
of the United States Government. 


154 


ON DIFFERENTIAL EQUATIONS. 155 


will be seen that they imply, in particular, that z—2(J,+1Y,) satisfies, 
for ¢> 0, 
(0.3) |z|>0, |2|”=0 


or 
(0. 4) >0, 


according as » > 4 or pS}. Note that the last part of (0.3), |z|”=0, is 
not implied by the following consequence 


of Nicholson’s formula (for »= 4). 


On the other hand, the results and methods of Part I do not lead to the 
sequence of inequalities in (0.5). The question of higher order monotony 
of | z(t) |?, where z= a(t) + iy(t) is a complex-valued solution of a general 
equation (0.1), will be examined in Part IV. The results to be obtained 
will depend on Parts II and III which deal with differential equations of 
arbitrary order. The methods to be employed will be very different from 
and do not depend on those in Part I. 

It will remain undecided whether or not the results on the higher order 
monotony of | z|* can be sharpened to give such results about | z |. 

Applications of the results of Part IV to Bessel functions give the 
complete monotony (i.e., (0.5)) for |z|?—¢t(J,?+Y,?) for ¢>0 when 
p>. This proof is longer than the one involving Nicholson’s formula but 
has the advantage of applying to a large class of differential equations. 

It can be mentioned that Nicholson’s formula implies that, when p < 4, 
the derivative of | z|*—t(J,?-+ Y,) is completely monotone for ¢>0. It 
will remain undecided how to derive this result from a general theory of 
differential equations. A partial result in this direction is given in Section 22. 


Part I. Monotony and convexity of ||. 


This part of the paper concerns inequalities of the form (0.3) or (0.4) 
for some complex-valued solution w—2z(t) of (0.1). The main result is 
Theorem 3.1. 


1. Bounds for solutions. Let Q = Q(s) be a continuous, monotone func- 
tion of s for s > 8 satisfying, as 


(1.1) 1. 


156 PHILIP HARTMAN. 


Then 
(1.2) @U/ds? + Q(s)U =0 


has a pair of real-valued solutions U —X(s),Y(s) such that Z—X + iY 


satisfies, as s—>0o, 


(1.3) Z—expi Qi(r)dr-+0(1), Z,—iZ +-0(1), 


where Z, = dZ/ds; [12], Appendix. The solution U—Z(s) of (1.2) satis- 
fying (1.3) is uniquely determined, up to constant factors, by the require- 
ment that 

(1. 4) as 


[6]. In view of (1.3), the (constant) Wronskian of XY and Y has the 
value 1, 
(1.5) XY,—X,Y =1. 


Lemma 1.1. Let Q=Q(s) be positive, continuous and monotone for 
S<s<o and satisfy (1.1). Let U=Z(s) be the solution of (1.2) satis- 
fying (1.3) ass—>oo. Then, fors>S, 

(1. 6;) |Z (?}QS15|Z |? and |Z, |? S15 |Z, |?/Q 
or 

(1.62) |Z and |Z, |? 21=|Z, |?/Q 
according as 


(1.7%) dQ=o or (1.%)  dQso. 


It will be clear from the proof that the interval S<s<oo can be 
replaced by an interval S<s< S*(=00), if the assumptions (1.1), (1.3) 
refer to s—>S*. If S* <<, then Q can be defined at s—=S* by Q(S*) =1 
and is then continuous for S<s=8*. The existence of a solution U = Z(s) 


satisfying (1.3) is obvious. 
Remark 1. In addition to (1.6), it can be shown that 
(1.8) IZ |Z 
holds in both cases (1.7,) and (1.7,). It will be clear that strict inequality 
holds in (1.6) and (1.8) if Q(s) #1 for large s. 


Remark 2. The inequality |Z |*=1 in (1.6,) holds for s>S8 if the 
conditions Y > 0, dQ = 0 for s > 8 are replaced by Q(s) [0 for S<s=&) 


ON DIFFERENTIAL EQUATIONS. 157 


and Q(s) >0, dQ(s) 20 for s > So, where Sp is some fixed number. Note 
that Q is not required to be monotone on the interval S<s=S8). 


Proof of Lemma 1.1. If (1.2) is divided by Q(s) and differentiated 
with respect to s, and the result divided by Q(s), one obtains the equation 


+ Q-*(s)W =0, where s=s(r), 
and 
W=U’, dr=Q(s)ds, and dW/dr =U” /Q=—U. 


This makes it clear that it is sufficient to consider only the case (1.6,)-(1. 7:1). 
(This change of variables, which will be used several times in this paper, 
is suggested in part by Wintner [13]; cf. [3].) 

Let U = U(s) be any real-valued solution of (1.2). Then, by virtue of 
(1.2) and (1.7:), 


(1.9) d(U°Q + U,’) =U*dQ = 0 and d(U? + U,?/Q) =—(U./Q)*dQ S50. 
For arbitrarily fixed real ¢, let 
(1.10) U=X(s)cos¢+ Y(s)sing; 


so that (1.1) and (1.3) imply U?Q + U,?—>1 and U?+ U,?/Q—1 as so. 
Hence, by (1.9), for s>8, 


(1.11) and U?4+0,/Q=1. 


The first inequality in (1.11) shows that U?Q=1 for s>S and for 
every ¢. For a fixed s, choose ¢ in (1.10) so that U(s) = (X?(s) + Y?(s))4 
=|Z|. Then, for this value of s, it is seen that | Z |?Q =U?Q 1. 

The second inequality in (1.11) implies that U?(s) 21 if U,(s) =0. 
For a fixed s, choose ¢ so that U,(s)=0. By Cauchy’s inequality, 
U?(s) S|Z(s)|*. Thus, for this value of s, | Z(s)|? = U?(s) 21. 

This proves the first two inequalities in (1.6,). The last two are proved 
similarly. Hence Lemma 1.1 follows. 


On Remark 1. Again, only the case (1.6,)-(1.%1) will be considered. 
In view of (1.10), the expression U?Q + U,? is a quadratic form in (cos ¢, 
sing) for a fixed s. According to (1.11), the eigenvalues A of this form 
satisfy A= 1. It is readily verified, if use is made of (1.5), that the eigen- 
values A of this form are the roots of the equation A?—A(| Z |?Q+ | Z, |?) 
+Q=0. The inequality (1.8) merely expresses the fact that the largest 
root of this equation is at most 1. 


158 PHILIP HARTMAN. 


On Remark 2. In view of Lemma 1.1, it follows that |Z(s)|=1, 
|Z.(s)| 1 for Choose ¢ so that )|, sing 
——X,(8o)/|Zs(So)|. Then U(s) in (1.10) satisfies 


U (X¥,—X.Y)/| Z| —1/|Ze(So)| 1, We(So) 


Initial conditions U(S,) > 0, U;(So) =0 at s=S, andQS0 forS<s=8, 
imply, by a simple convexity argument, that dU(s) =0 on this interval. 
Hence U(s) 2U(S,) =1 for S<sZ8_. Cauchy’s inequality gives there- 
fore | Z(s)| 2U(s) 21. 


2. Bounds for q|z|*. The desired inequalities for solutions of (0.1) 
do not follow directly from Lemma 1.1 but can be obtained from this lemma 
after a standard change of variables (Riemann-Liouville). 


LemMMA 2.1. Let q(t) be a positive function of class C? for t>T, 
with the properties that 


(2.1) Q=1 + 59/16q° — q”/4q° =1 — 
satisfies Q(t) > 1 as and that either 

(2. 21) dQ=0fort>T, 

or there exists a T, = T, such that 

(2. 22) Q=0 forT, <t=T, and Q>0, dQ20 fort>T,. 


Then (0.1) possesses a pair of real-valued solutions u= x(t), y(t) such that 
z=x-+ wy satisfies, as 


(2.8) ~expi f QM 


(2. 4) ry —a’y=1; 


and, fort etther 
(2.5,) oor. 
according as (2.21) or (2.22) holds. 
If q is of class C*, then Q(t) has a derivative given by 
= (18qq'q” — 15q’* — 497q’”) 
Note that dQ? =0 is implied by 
(2.6) q>0, f 20, 


ON DIFFERENTIAL EQUATIONS. 


Proof. By the Riemann-Liouville change of variables, 


(2.7) U=ugt and ds=4qi(t)dt, 


the differential equation (0.1) is transformed into (1.2), where Q is defined 
by (2.1) and¢—¢(s). The interval 7, << ¢ <0 is changed into some interval 
(—_o)S<s<8* So. 

By Lemma 1.1 and the Remark 2, (1.2) has a pair of real-valued 
solutions U—X,Y satisfying (1.3), as s>S*, and |Z|S1 for 
according as (2.2,) or (2.22) holds. If 


(2.8) y=Y/¢ 
(cf. (2.%)) are the corresponding solutions of (0.1), then (2.3) and the 
assertion concerning (2.5) follow. 

8. Monotony and convexity of |z|. The main theorem of this part 
of the paper can be obtained as a consequence of Lemma 2. 1. 


THEOREM 3.1. Let q(t) satisfy the conditions and x(t), y(t) the asser- 
tions of Lemma 2.1. 


(i) Let Q(t) be in case (2.2,). Then 
(3.1) |2| >0, |2|”20 


fort>T>. If, in addition, q(t) is continuous for t>0 and q(t) S0 for 
0<tT), then (3.1) holds fort>0. If z(t) remains bounded as to 
(that is, if g(t) =const. > 0 for large t), then, for t>0, 


(3.2) |z|>0, |2|”20. 
(ii) Let Q(t) be in case (2.22). Then, fort >To, 
(3. 3) |z| >0, | z|’=0, 


It is readily verified from the last part of (2.1) that if g and Q are 
monotone and 0 << q(«) 0, then Q—1last—>o. This gives the following 


Corottary. Jf g(t) =0 for 0<tST, and (2.6) holds for t >To, 
then (3.2) holds for t>0. 


Strict inequality holds in the inequalities in (3.1), (3.2), (3.3) unless 
Q=1 for large ¢, that is, unless g(t) = + c.)~* for large t, where Ce 
are constants. 


159 


160 PHILIP HARTMAN. 


If the half-line t > T, is replaced by a finite interval, then the inequalities 
| z |’S0 in (3.2) and |z|’2=0 in (3.3) need not be valid. 


Proof of Theorem 3.1. If u=<a(t),y(t) are solutions of (0.1) satis- 
fying (2.4), then two differentiations of r—|z|— («2?-+ y?)4 show that 


(3. 4) 


In case (2.2,), the relation (2.5,) holds for ¢>T7 >. It also holds for 
t>T ifgs0 for T<t=T,. Hence for all ¢ under consideration. 
This gives (3.1) and, hence, (3.2) if r(¢) is bounded as too. 

In the case (2.2.), the relation (2.5,) holds and implies 7’ =0 for 
t>T >. Since r> 0, this gives (3.3). 


4. Applications to Bessel functions. The functions v=J,(t), Y,(¢) 
are real-valued linearly independent solutions of the Bessel equation 


(4.1) (1—p?/t?)v =0. 


Hence u=t4J,(t), t4¥,(¢) are real-valued solutions of 


(4. 2) + (1—a/t?)u = 0, where =p? — 4. 
Note that 

(4.3) >,=, <0 according as (0=)u>,—=, 
Furthermore, 

(4. 4) a(t) + iy(t) — —4¥,(t)), 


where 6 = 4)z, is a solution of (4.2) satisfying 
(4. 5) 2 —iettt o(1) as to. 


If (4.2) is identified with (1.2), Lemma 1.1 and the Remark 2 following 
it give the inequalities 
(4.6) t(Jy?+ ¥,?)(1—a/#?) if p> 4, 
(4.7) + ¥,?)(1—a/#?) > if 3, 


for ¢>0. Inequalities of this type were obtained by Schafheitlin [7], p. 86. 
If (4.2) is identified with (0,1), so that g=1—«/d?, then the corre- 
sponding function (2.1) is 


(4.8) Q=1+4 5a?/4(t?—a)® 4 3a/2(t?—a)? 


ON DIFFERENTIAL EQUATIONS. 161 


for >max(0,«). If (ie, «>0), then Q satisfies (2.2,) with 
T, =, so that (2.4,) is valid and becomes 
(4.9) + Y,?) < if t > and p> 
Ii p< <0), then Q satisfies (2.22) with T=—0, T,; = and 
(2.42) gives 
(4.10) (#@—p? + + > 2x if t>0 and p<h. 

It also follows from Theorem 3.1 that r= t4(J,?(t) + Y,?(#) )4 satisfies 
r>0, <0, or r>0, > 0, <0 for t>0 according as p> 4 


or OS p<. 
The inequality (4.10) contrasts sharply with the known inequality 


(4.11) + Y,7) if t= p20 
usually deduced from Nicholson’s formulae, cf. [9], p. 447. If »=4, the 
last inequality is contained in (4.9); a derivation of (4.11) based on the 


results of Sections 1-3 will be indicated. 
If t= e*, then the Bessel equation (4.1) becomes 


(4.12) Uss + (t?—p?)v =0, where 

The change of variables analogous to (2.7), 

(4.13) V = (t#?— do = (t? — p?)4ds, where t > yp, 
transforms (4.12) into an equation 


(4.14) Voo+ RV 
where 


R=1-+ + 4y7e?*) /(e?8 — and s=s(c). 


The equation has the pair of solutions V = (t? — p?)4J,, (¢?—p?)*Y,, where 
t—=e* >» and s=s(c). In terms of s, the coefficient R can be written as 


(4.15) R=1-+ + p?e-*) (> prke-*ks) 3, > 


It is clear that R is a decreasing function of s (hence a), so that (1.6,) in 
Lemma 1.1 applied to (4.14) gives (4.11). 
It can also be verified from Theorem 3.1 applied to (4.14) that 
r= (?—y?)4(J,?+ is increasing for cf. [9], pp. 446-447. 
The inequalities (4.9), (4.10), (4.11) suggest the examination of the 


11 


PHILIP HARTMAN. 


162 


function (t?—p? + B)4(J,?+Y,7), where 820 is a constant. Note that 
if the Riemann-Liouville change of variables (2.7) is altered to 


(4.16) U=(q+f)u, 

then (0.1)'is transformed into 

(4.17) Use + [1+ + +8)? + 8) ]U =0. 
Correspondingly, if (4.13) is replaced by 

(4.18) V=(#—y)%, do = (t? — y)4ds, where t? = e?* > y 


and y—yp?—, then (4.12) is transformed into (4.14), where 
(4.19) R—=1-+ [et(1—48) + 4ye(1 + 28) —y)*. 
From the relation 
(4.20) + 2y(5 + 48) 

+ 4y*(1—) /2 


it is clear that there exists a 7 —T7(8,y») >0 such that according as 8 >} 
or 8 <j, RF is increasing or decreasing for t=T. Thus, by Lemma 1.1, 
for »=0, 


(4.21) Y,2) > if t>T and B >t, 
(4.22) (#—p?+)8(J,2+¥,2) if t>T and B> ft. 


These inequalities reduce to (4.9), (4.10) if B—4. 

Obviously, T can be chosen to be 0 [or yt= (u?—£)4] in (4.21) [or 
(4.22) ] if B=1 and (so that y= 0) [or and p= (so that 
y=0)]. 


Part II. An inhomogeneous second order equation. 


5. Preliminaries. The familiar “alternating series argument” shows 
that if f(¢) is continuous for ¢>0, f(t) 20, df(t) and 0 as 
t—oo, then 


(5.1) w(t) — F(s)sin(s—t)as 


is convergent and non-negative for ¢>0. In addition, if f(t) is convex, then 


(5.2) w'(t) —— J “F(s)eos(s—t)as 


ON DIFFERENTIAL EQUATIONS. 163 


ma is non-positive. Furthermore, the two implications f2=0, fS0>w20 
and f=0, f/S0, f7205 w’=0, w’ <0 are the first two of an infinite 
sequence of implications. This is clear if it is noted that (5.1) is the unique 
solution of 


(5.3) w” + w= f(t) 


= 0, 
satisfying w—>0 as t> 0. 
It will be shown below that similar facts are valid for a particular 
solution of the more general second order equation 
(5.4) + q(t)w—= f(t) 
under suitable qualitative conditions on g. (The results will be extended to 
linear and non-linear differential equations of higher order in Part III.) 
Consider the homogeneous equation belonging to (5.4), 
(5.5) v’ + q(s)vu=0. 
y)‘, It will be assumed below that (5.5) is oscillatory at soo, i.e., that every 
>4 solution v= v(s) of (5.5) has infinitely many zeros custerilng at soo. 
11, A Green’s function G(t,s) for s=t for (5.5) will be needed. This 
will be defined as follows: Let ¢ be fixed. For s=t#, let v(s) = G(t,s) be 
the solution of (5.5) determined by the initial condition, 
(5. 6) G(t,s) =0 and G,(t,s) =1 if s=—+t. 
The analogues of (5.3) and (5.1) are (5.4) and 
[or (5. 7) G(t, s) f(s) ds. 
hat 
Formally, the derivative of (5.7) is 
(5.9) w’(t) G,(t, s) f(s) ds, 
t 
since G(s,s) = 0. 
a The results of Part II deal with the “order of monotony” of the par- 
ticular solution (5.7%) of (5.4). 
Many of the arguments will depend on the fact that if v—v(s) is a 
solution of (5.5) and q is positive and monotone, then (5.5) implies that 
(8-10) d(v? + 


(5. 11) d(qv? + v’?) = v'dq. 


2 972 2 . 
Hence, v? + v’? and qv’?-+ v” are monotone. In particular, the sequence of 


164 PHILIP HARTMAN. 


maxima of | v| and | v’| are monotone (since these maxima occur when v’ = 0) 


and respectively). 


In order to state the results suc- 


6. Order of monotony of (5.7). 
cinctly, it will be convenient to introduce the following terminology: 


Definition 6.1. A function f(t) will be said to be of class M,(a,6) or 
monotone of order n on a<t<b if it has n (20) continuous derivatives 


f,f’,: +, f™ satisfying 


(6. 1;) 


for 7=0,---:,nanda<t<b. M, will be an abbreviation for M,(0,0). 

In all of the theorems below where it is assumed that n > 0 and that 
certain functions f,q’,q;,- - - are of class M,(a,b), the assumption of the 
existence and continuity of the n-th derivative and the inequality corresponding 
to (6.1,) can be weakened to the assumption that the (n—1)-st derivative 
multiplied by (—1)" is non-increasing. In fact, if n >1, the conditions 
on the (n—1)-st and n-th derivatives can be replaced by the conditions 
that the (n—2)-nd derivative multiplied by (—1)" is non-decreasing and 


is convex. 


(—1) f(t) 20 


Definition 6.2. f(t) will be said to be of class Mum(T, 0) if f € M,(T,«) 
and f has m derivatives for t > T, satisfying 


(6. 2;) f(t) as t>00 


for 7=0,:--,m. (Note that f€ Mno(To,0) implies that f€ Man+(To,0) 
if m=1. Thus the essential cases of Mnm(To,0) are m=O and m=n.) 


In analogy with the above, Mnm == Mnm(0,0 ). 


THEOREM 6.1,. Let n2=0. Let q(t) have a derivative q’(t) of class 
M, and let0<q(o) Soo. Let f(t) be of class Mniso. Then (5.4) has a 
unique solution (given by (5.7)) of class Manso. 


Remark 1. Under the assumptions on gq, the only solution v = v(s) of 
the homogeneous equation (5.5) satisfying v’(s) >0 as s>o is v=0. 
(In fact, by (5.11), the successive maxima of | v’(s)| are non-decreasing.) 
Hence, (5.4) has at most one solution satisfying w’—>0 as t->0o. Also, if 
0<q(w) <o, then (5.4) has at most one solution satisfying w—0 as 


Remark 2. If all solutions of (5.5) tend to 0 as t> 0 (e.g., if “q(t) >» 


ON DIFFERENTIAL EQUATIONS. 165 


smoothly as too,” cf. [2]), then the condition f(o) 0 implicit in 
f€ Mni,o can be omitted in Theorem 6.1, except for assertion w”—>0 


when n= 0. 

Remark 3. Suppose that there isa tT), 0ST) <<, such that q=0 
or g>0 according as 0 << t= or t>T,). Forn>0 and0<t< To, the 
conditions on g and f can then be lightened somewhat: for n =1 and n= 2, 
it is sufficient to require only g <0, f= 0; for n > 2, it is sufficient to require 

that q’,f€ Mn2(0,T.). If the assertion w€ Manse is weakened to w€ M,,> 
for n = 1,2, the conditions can also be lightened for ¢ > Ty: for n=—1, it is 
sufficient to require that q is non-decreasing, f=0 is non-increasing with 
f(%) =0 and f/q is convex; for n= 2, it is sufficient to impose the addi- 


tional condition that — (f/q)’ is convex. 


Remark 4. The proof of Theorem (6.1,) will give the following a 


priori bounds for w and w’: 
(6.3) OSw(t)Saf(t)/q(t), w’(t)| Saf ; 


cf. (7.6) below. The upper bound for w is improved successively to 2f (t)/q(t) 
and f(t)/q(t) in the proofs of Theorems 6.1, and 6.1,; cf. (8.4) and (9.2). 
Theorem 6.1, and the theorem of Hausdorff-Bernstein imply the following: 


Corotuary. Let q,f satisfy the conditions of Theorem 6.1, for n 
=1,2,: + -, then the solution (5.7) of (5.4) has a representation as a Laplace- 


Stieltjes integral 
(6. 4) w(t) estdo(s) fort >0 
0 


with some non-decreasing weight function o =a(s). 


Theorems 6. 19, 6. 1,, 6.1, and the corresponding Remark 3 will be proved 
in Sections 7, 8, 9 below. Theorem 6.1, with n > 2 is more subtle and does 
not seem to follow from the cases n = 2 by successive differentiation. In fact, 
its proof will involve a new existence proof for each n for a differential 
equation of order n—2. Theorem 6.1,, with n> 2, will be proved in 
Sections 16-17 in Part III below. 

Theorem 6.1, deals with the case of a non-decreasing g. In order to state 
analogous theorems for the case of a non-decreasing q, it will be convenient 


to introduce the following definition: 


Definition 6.3. Let the classes of functions DM,(a,b), 
DMym(T'o,00), DMnm be defined as the analogues of My(a,b), Mn, Mam(To.% ), 


0 
C- 
or 
es 
at 
he 
1g 
ve 

18 
id 

) 
) 

) 
a 
yf 
) 
if 
1S 


166 PHILIP HARTMAN. 


Mam in which the j-th derivative f —d’f/dt/ on (6.1;) and/or (6.2;) is 
replaced by Df, where D is the differential operator q-*(t)d/dt. 

THEOREM 6.2,. Letn=0. Let q(t) be continuous, non-increasing and 
let (5.5) be oscillatory at (in particular,g>0). Let 1/q* have a deriva- 
tive of class DM,. Let f/q be of class DMnss,o. Then (5.4) has a unique 
solution w= w(t) (given.by (5.7)) of class Mo» and, if n>0, then —w’ 
is of class DMn-1.n+1- 


Theorem 6.2, will be proved in Section 10. Theorem 6. 2,, with n > 0, 
will be deduced from Theorem 6.1,_; in Section 11. 


7. Proof of Theorem 6.1,. Note that g(s) =q(t) >0 fors=t>T*). 
Thus, if v—v(s) 40 is a solution of (5.5), then the graph of y= | v(s)| 
consists of a sequence of “arches.” The first and second comparison theorems 
of Sturm imply that if the k-th arch is over the interval s,s s;,, and 
q(so) Sq(s°) when S% Ss then a reflection of the k-th 
arch across the line s = s;, gives an arch lying under the (k —1)-st arch (i.e., 
| v(s.-+t)| S| v(s,—t)| for OS — sy and — S 8% — 8-1) ef. 
[4], p. 531 and p. 5388. Thus if f€ Myo, the “alternating series argument” 
implies that (5.7) is convergent and non-negative for ¢ > 0. 


The kernel G(t,s), for s=t, can be written in the form 
(7.1) G(t, 8) = v1(t)v2(s) 
where v=v,(S),v2(s) are arbitrary solutions of (5.5) subject to the 
Wronskian condition 
(7.2) V,(S) v2’ (8) — =1. 
Also, for fixed ¢, 
(7.3) G,(t, s) (t) v2(s) — v1 (8) v2’ (t) 
is a linear combination of v,(s),v2.(s) and is, therefore, a solution of (5.5). 
Hence, the integral in (5.9) is convergent (for the same reasons as is the 


integral in (5.7)). 

If the upper limit of integration o is replaced by a fixed 7, it is seen 
that (5.7) and (5.9) represent a solution of (5.4) and its derivative. Thus 
the same is true for (5.7), (5.9) if it is verified that the integrals are 


uniformly convergent on compact ¢-sets of ¢ > 0. 


By the alternating series argument, the remainder | J | of the integral 
T 


Vv 


ON DIFFERENTIAL EQUATIONS. 167 


in (5.7) or.(5.9) is majorized by C(b—a)f(T), where s =a, b are successive 
zeros of v(s) —G(t,s) or v(s) = Gi(t,s), aS and | or | G| is 
majorized by C for s=t. The monotony of g implies that b—aS a/qi(t). 
By (5.10), v(s) =G(t,s) is such that V =v? + v’?/q is non-increasing for 
s>t. Hence v?(s) V(s) S V(t) fors2=t>T>. Thus 


| @(t,s)|?S | t)|2/q(t) —1/gq(t) for s=t> To. 
Similarly (cf. (5.8)), 
(7.5) | S| t) |? =1 for s=t>To. 
Hence C can be chosen to be 1/q3(t), 1 in the respective cases of (5.7), (5.9). 
Thus the remainder | fil is at most f(7')/q(t) or zf(T)/q'(t) in the cases 


(5.7) or (5.9). This proves the uniform convergence of (5.7), (5.9) on 


compact subsets of ¢ > 0. 
This argument (with T = 1?) also shows that if ¢ > 7, then 


Hence w, w’ 0 as t—oo. Also w” =—quw- f satisfies | w” |S (x + 1)f(t) 


>0ast—o. This proves Theorem 6. 1 . 


8. Proof of Theorem 6.1,. In order to prove w’(t) =0 for ¢>0, 
consider first only ¢ > 7 and write (5.9) as 


(8.1) w’(t) 1LF(8)/a(s) Jas. 


Since v(s) = G;(t,s) is a solution of (5.5), it follows that G;(t,s)q(s) 
=— Giss(t,s). An integration by parts applied to (8.1) gives 


The integrated terms vanish for, on the one hand, (7.3) show that Gis(t, ¢) 
=0, and, on the other hand, G;.(t,s)/q3(s),1/q3(s) =O(1) and f(s) 0 
ass—>oo, (The boundedness of G;,(t, s)/q3(s) follows from the non-increasing 
character of v? + v’?/g, where v(s) = G;(t,s) ; ef. (5.10).) 

Another integration by parts gives 


w'(t) = (Ge(t,8) +1) (f(8)/q(s)) * — Jf (ats) + 1] 


The integrated term at s=¢ vanishes for G, (t,t) by (7.2), (7.8). 


(7.6) OS w(t) Saf(t)/q(t), | Saf(t)/q(t). 


168 PHILIP HARTMAN. 


The term at soo is also 0 for v(s) = G;(t,s) is bounded and (f/q)’—> 0 as 
s—oo. (The last limit relation follows from the fact that (f/q)’ is mono- 
tone and integrable over¢{=s<o.) Thus 


The assumptions on f,g imply that (f/q)”20 for s=T>. The factor 
G,(t,s) +120 by (7.5). Thus w’(t) [0 for 

In order to deal with w’(t) on the interval 0<¢=T,), when T,>0, 
note that if a solution of (5.4) (i.e., w” =—qw-+f) satisfies initial condi- 
tions w(T,) >0, w’(T.) <0, then g=0 and f=0 imply, for reasons of 
convexity, that w(t) 20, w’(t) =0, w”(t) 20 for O< By con- 
tinuity, the same holds for initial conditions w(T,) 20, w’(T,) S0. Hence 
w(t) =0 for 0 < 

It remains to show that w’”(t) 0 as to. Note that the first factor 
in the integrand in (8.2) satisfies | G,(t,s) for >To, by (7.5). 
Hence (8.2) implies that 


(8.3) 0 <—w'(t) S—2(f(t)/g(t))’ for t> To. 
Thus, the first inequality in (7.6) can be improved to 

(8. 4) OS w(t) S 2f(t)/q(t) for 

By (8.3) and the boundedness of q’, 

(8. 5) 0S —qu’ S—2f’ + 2fq’/q— 0 as 
Thus a differentiation of (5.4) shows that 

(8.6) — qu’ as to. 


This proves Theorem 6. 1,. 


9. Proof of Theorem 6.1,. It will first be shown that w”’(t)=0. If 
g=0 for0 << then w” =—qu+f20for0<tST,. Thus, it can 
be supposed that ¢ > T, and that q(t) >0 on this ¢-range. Dividing (5. 4) 
by g and differentiating twice gives 


(9.1) W” + qW = (f/q)”, where W=w’’/g. 


Since q satisfies the conditions of Theorem 6.1,, hence Theorem 6.1 , and 
(f/¢)” € Mi,o(To,), it follows from Theorem 6.1, that (9.1) has a non- 
negative solution for t > Ty given by 


W(t) (ts) (f/a)"as 


as 


ON DIFFERENTIAL EQUATIONS. 169 


satisfying W, W’,W”’—>0 as to. By uniqueness (cf. Remark 1 following 
the statement of Theorem 6.1,), this solution is W = w’/gq, where w is given 
by (5.7). 

(Note that under the conditions of Theorem 6. 1,, the inequality in (8. 4) 
can be improved to 
(9.2) w(t) Sf(t)/q(t) fort > To. 
This is clear from the inequality w” = f—qw = 0.) 

It remains to show that 0 ast—oo. Let (5.4) be differentiated 
to give 
(9.3) w’’’ +. qu’ = f’ —q’w. 
The function — (f’—q’w) is of class Mzo(To,0). Thus, by Theorem 6. 1,, 
equation (9.3) considered as a second order, inhomogeneous equation for w’ 
has a unique solution, the negative of which is class M,,3(To,0). Since 
uniqueness refers to the class of solutions satisfying (w’)’—>0 as to, this 
solution is the derivative of (5.7). Thus —w’€ M,, and w=0, that is, 
w€ M,,. This proves Theorem 6. 1p. 


10. Proof of Theorem 6.2,. Since v(s) —G/(t,s) is a solution of 
(5.5), G@(t,s) =— Ges(t,s)/q(s). Thus, without considering questions of 
convergence, the integral in (5.7) can be written formally as 


(10.1) w(t) f “Galt 


An integration by parts gives 


(10.2) w(t) = (1—G,(#, s)) (f(s)/q(s)) ]s-1°* — G(t, 8) |(f/q)’ds. 


The integrated term at s = ¢ vanishes since 1— G, (t,t) =0 by (6.2). Since 
q is non-increasing, (5.11) shows that V —qv?-+v” is non-increasing if 
v(s) =G(t,s). In particular, v’?(s) = V(s) = V(t) =1 for sZé and so, 
| G.(t,s)| 1. Thus 


(10.3) 0=1—G,(t,s) =2fors=t. 


Since f/g— 0 and so, the integrated term at so in (10.2) is 0. Since 
(f/q)’ S0 and is (absolutely) integrable over ts <0, it follows that the 
integral in (10.1) is convergent and 


(10. 4) w(t) =— Si 1 — G,(t, s) ] (f(s) /q(s) )’ds. 


170 PHILIP HARTMAN. 


In fact, the integral in (10.1) is uniformly convergent on compact subsets 


of ¢>0. 
Since 1— G,(t,s) =0 if s—t, differentiation of (10.4) gives 


(10.5) w’(t) = (f(8)/a(s) 

This interchange of differentiation and integration is valid for the monotony 
of g and (5.11) imply that 

(10. 6) | Gis(t, s) | S Sconst. for s=t. 


It is now readily verified that (5.7) represents a solution of (5.4). 
Also (10.3) and (10.4) show that w(t) 20 and 


(10. 7) 0S w(t) = 2f(t)/q(t) for t>0. 
Also, (10.5) and (10.6) give 
(10. 8) | (t)| Sf(t)/q(t) for > 0. 


These facts imply the assertions of Theorem 6.2), namely, w(t) 20 and 
w,w’,w’—>0 as too. 


11. Proof of Theorem 6.2,,m >0. Dividing (5.4) by q and differen- 
tiating gives 
(11.1) (w""/q)’ + w= (f/q)’. 
On introducing the new variables W, 7 defined by 
(11.2) W =—w’ and dr —q(t)dt 
(11.1) can be written as 
(11.3) (1/q)W=—D(f/q), where D=d/dr = q(t) d/dt. 


Note that (1/q?)’—=2D(1/q). Thus an assumption of Theorem 6. 2, 
implies that D(1/q)€ DM,. Since n>0, D(1/q) 20 and D?(1/q) S0, 
as a function of 7, 1/q is non-decreasing and concave. Since (11.3) is 
oscillatory at the upper end of the r-range, it follows that 7(t) satisfies 
Thus, 0 << ¢<oo is mapped onto some range 

The assumptions of Theorem 6.2, imply that, as functions of r, 1/gq and 
— D(f/q) satisfy the assumptions of Theorem 6.1,-,. Hence, (11.3) has a 
unique solution W = W(t) of class 

Uniqueness refers to the class of solutions satisfying DW—0 as t>o. 
According to the last section (11.2), the function w given by (5.7) is such 


ON DIFFERENTIAL EQUATIONS. 171 


that W = —w’ is a solution of (11.3). Also, DW =—w”/qg=w—f/q->0 
as t—>00. Thus, if w(t) is defined by (5.7%), then W=—w’€ DMoinsr. 
This proves Theorem 6. 2p. 


Part III. Differential equations of higher order. 


12. Statement of results. The object of this part of the paper is to 
obtain analogues of Theorem 6.1, for differential equations of the form 


and for related non-linear equations of order k + 2. 


THEOREM 12.1,. Let n2=0. Let q(t) posssess a derivative q’(t) of 
class M, and 0<q(0) So. Let f(t) € Maio. If assume 


gj(t) € Many, and 


Then (12.1) has a unique solution w= w(t) of class Mask, nsxs2- 


Remark 1. If, in addition, g,( 0) —0 and c is a positive constant, then 
(12.1) has a unique solution w—w/(t) such that w—cE€ Mai This 
follows by introducing w—c as a new dependent variabile in (12.1). 


Remark 2. If all solutions of (5.5) tend to 0 as t—>oo, then the condi- 
tions —0 in Theorem 12.1, and the condition = 0 in the last 
remark can be omitted except for assertion w—0O when n—0. 


Remark 3. If there is a 7, >0 such that q(¢) =0 for 0<t<T, and 
q(t) > 0 for ¢> To, the conditions on q, f, g; can be reduced on the interval 
0<t<T corresponding to Remark 3 following Theorem 6.1,: for n 1, 2, 
it is sufficient to require only g=0, f=0, g;=0; for n> 2, it is sufficient 
to require that q’,f,9;€ Mn-2(0, To). 

Theorem 12.1, and its Corollary should be contrasted with analogous 
theorems in [5], Appendix, in which it is assumed that f=0 and that 
v’ + q(s)vu=0 is disconjugate (rather than oscillatory) for s>0, but no 
conditions like (12.2)-(12.3) occur. 


PHILIP HARTMAN. 


Remark 4. It should be noted that neither of the conditions (12.2), 
(12.3) can be omitted in Theorem 12.1, for any n (if k2=1). 
In order to see this, consider first the differential equation 


+ w’ =—1/(t+1), 


where k~=1, g=1, g =0 and f=(t+1)-. All conditions of Theorem 
12.1, hold for every n except (12.2). If w—w/(t) is a solution for which 
w’ = 0, then 


w’(t) —— + 1)-?sin(s—t)ds, 


by the Remark 1 following Theorem 6.1,. It is easily seen, from an inte- 
gration by parts, that w’(t) =—t*-+ O(t*) as Hence w(t) 
as t—>o. Consequently, (12.2) cannot be omitted in Theorem 12. 1,. 


The differential equation 
(12. 4) +- w’ + Cw/t log t = — log? ft, 


where C >1 and e >0, satisfies all conditions of Theorem 12.1, for every 
n except (12.3) if > 0 is replaced by {> 2. If this equation has a solution 
w=w(t) of class M,(2,0), then its derivative satisfies 


— w’ = 2e f sin (s — t)ds/s log? s 


(12. 5) 
+C w(s)sin(s—t)ds/s log s. 
t 


Let T be so large that t= T implies the inequality 


cf sin (s— t)ds/s log? s = 1/t log? t 
t 


and the inequality which results if C is replaced by 2¢ and the 1 on the right 
by «. Then, since the last term of (12.5) is non-negative, — w’ = ¢/t log’ t 
fort=T. Hence w~e/tlogt-+ (a non-negative, non-increasing function). 
Thus (12.5) implies — w’= 2e/tlog?¢ and so, w(t) = 2e/tlog¢+ (a non- 
negative, non-increasing function). Continuing this argument, one obtains 
the contradiction w(t) = me/tlogt for {= T and m—1,2,-:-. 

Actually, conditions (12.2), (12.3) can be lightened somewhat. 


THEOREM 12.2,. Let q,90,° *,9x-,f satisfy the conditions of Theorem 
12.1, except that (12.2)-(12.3) is replaced by the following when k=1: 
There exist positive, continuous functions «,(t),- + -,¢.(t) for large t with 


the properties that, as to, 


172 


ch 


ON DIFFERENTIAL EQUATIONS. 
(12.6) f(s) ds = 0 (en 


jor m=1,:-+,k. Then (12.1) has a unique solution w= w(t) of class 
Satisfying = O(e-;(t)) as to for 


When (12. 2)-(12. 3) hold, then (12. 6)-(12. 7) are satisfied by ¢j(¢) =1/t*4 
for j=1,:--,k. Also, if w=w(t) is any function of class M,, then 
w(t) =O(1/t*)) as to for 7=0,---,4—1. Thus Theorem 12.2, 
is contained in Theorem 12. 3p. 


Remark 5. On the one hand, Theorem 12. 2, becomes false if “0(€m(t))” 
in (12.7%) is replaced by “O(em(t)).” This can be seen from the example 
(12.4) where k=1, g=1, g.=C/tlogt and f= 2e/tlog*t, so that the 
integral in (12.7) is majorized by Ce,(t) if «.(¢) =1/log¢. On the other 
hand, (12. 6)-(12.7) can be weakened to 


(12.9) G(s) ds Sten (2), 


for {= T and m=1,::-,k, where 0<6<1, T is some (fixed) positive 


number, 
k-1 
(12. 10) G(s) = (8) J — 1)!, 


Yo=7, yi =2 and yn»—1 for n=2. It is clear that (12.6)-(12.7) imply 
(12.8)-(12.9) if en(¢) is replaced by (const.)em(¢) for a suitable constant. 
The factor y, = 1 in (12.9)-(12.10) for n= 2 cannot be replaced by a smaller 
constant (even when the constant implicit in the O-term of (12.6) is 
arbitrarily small). For example, (12.4) has a completely monotone solution 
ife >0 and 0=C <1, but it has no non-negative, non-increasing solution 
C>1. 


Remark 6. Under the assumptions of Theorem 12.2,, (12.1) can have 
more than one solution of class Mnyx.nsks2. For Theorem 20.1, below implies 
that the differential equation (18.2) has a non-trivial solution w= w(t) 40, 
as well as the trivial solution w=0, of class Masinss. In (18.2), where 
k=1, f=0 and =o, the function «,(¢) satisfies (12.7) since 
0, t>0. 


174 PHILIP HARTMAN. 


Theorem 12.2, has an analogue for non-linear equations of the form 
(12.11) w+) + g(t)w = (—1)*F(t,w,—w’,: (—1)**w&), 
Introduce the abbreviations 
(12.12) f(t) F(t,0,- - -,0) 
and, if F—F(t,a,° 

(12.13) 9j(t) =0F /da; at for j=0,1,-- -,k—1. 


THEOREM 12.3,. Let q(t) be as in Theorem 12.1, and q(t) >0 for 
t>0. Let F(t,%,° be defined for t>0, and have con- 
tinuous partial derivatives satisfying 


(12. 14) (— 1) 0 
for mSn+1 and let 


finally, let there exist positive continuous functions ¢,(t),- - +,ex(t) for 
t > 0 and a constant 6,0 <6 <1, such that (12.12), (12.13), (12.10) satisfy 
(12.8), (12.9) for t>0 and m=—1,---,k. Then (12.10) has a unique 
solution w= w(t) of class Satisfying (—1)iw(t) S — j)! 
for t>0 and j=0,- -,k—1. 


Combining Theorems 12. 1,, 12. 2, or 12.3, with the theorem of Hausdorff- 
Bernstein gives 


Corottary. If the conditions of Theorem 12.1, [or 12.2, or 12.3,] are 
satisfied for n=0,1,- - -, then (12.1) [or (12.1) or (12.11) ] has a unique 
[or at least one] solution w= w(t) representable in the form 


w(t) — ettdo(s) fort>0, where do= 0. 
0 


If, in addition, to the conditions of Theorem 12.1n, go(%) =0 and c>0, 
then (12.1) has a wnique solution representable in the form 


w(t) estdo(s) fort >0, where do = 0. 


The existence statement of Theorems 12.2, and 12.3,, for the cases 
n==0,1,2, will be proved in Section 13, the uniqueness statement in Section 
14. This will be used to prove Theorem 6.1, for n> 2, first for the case 


ON DIFFERENTIAL EQUATIONS. 175 


q(%) <o in Section 15 and then for the case q(o) oo in Section 16. 
These results, in turn, will be used to complete the proof of Theorems 12. 2, 
and 12.3,. n > 2, in Section 17. 


13. Existence in Theorems 12.2, and 12.3,, m2. In this section, 
it will be supposed that n 0, 1, or 2. If &=0, the assertion to be proved 
is contained in Theorem 6.1,. Suppose therefore that k= 1. 

In order to deal with Theorems 12.2, and 12.3, at the same time, let 
(12.1) be written in the form (12.11). For the proof of Thetorem 12. 3,, 
it will be supposed that 7 is any positive number; for Theorem 12. 2,, it will 
be supposed that 7’ is so large that g(t) >0 and (12.8), (12.9) hold for 
ta 7. 

The desired solution will be obtained by successive approximations. Let 
If wm(t) have been defined, put 


(13.1m) fim (t) = F(t, wm (t), — Wm’ (t),° (—1)*¥*wm* (t)). 


If possible, define the k-th derivative of wm. by 


(18. 2m) (t) (—1)¥ 8) ds 


and the lower order derivatives by 


(13. 3m) Wn (¢) ff (s—t)* (8) ds/(k—j—1)! 
t 
for j=0,---,4—1. Thus, formally 


(13. 4m) Wingy + ™ (— 1)*fm. 


It will be shown, by induction on m, that w)=0 and (13. 1)-(13. 4) define 
a sequence of functions wo, satisfying 


(13. 5m) Wm Mask, neks2) In € 
in fact, Awm—= Wm— Wm-1 satisfies 
(13. 6m) AWms1 € 


(13.%m) (—1) S (1— 0) —j—1)! for t= T, 


Note that, by (13.1,) and (12.14), 


(13. 8) Wm€ implies fim € 


1, 


176 PHILIP HARTMAN. 


If Wm; have been defined and Afm = fim—fm-1, then 


k-1 1 

(13. 9) Afan(t) => (—1i(f 
j=0 

where =0F/0a; and 


(t,7) = Fj(t, + (1—r) vm], 


Thus, by (12.14), 
(13. 11) (13. 5m), (18. 5ms1), (18.6m) imply € 
If (13. 7%),° +, (18.%m) hold, then 


Also, if (—1)4Awm,. = 0, then 
(—1)4 + (1— Wm] S for OS 1. 
Thus, by (13.9), (12.13) and (12.10), 
(13.12) (13.6) and (13.%),° - *, (13. %m) imply (13. 13m), 


where 
(13.13) ynAfima/7 S (1—0)0"G for t= T. 


If Wms, Wm have been defined for some m = 1, then the k-th derivative 
Of Wms: Or, equivalently, of Aw,»,, will be defined by the use of 


(13. 14) + = (—1)*Afm. 


If (13.5m+1), (13.5) and (13.6m_,) hold, so that Afm€ by (13.11), 
then (13.14) considered as a second order equation for Awm,. has, according 
to Theorem 6.1,, a unique solution such that (—1)*Awmi™ € Mans2 and 


(13.15) = (—1)* f ds, 
Also, the remark concerning (6.3) shows that for t= T, 
(13. 16) | Aw mss | S yndfm/Q, | | S 
If, in addition, (13.13,,_,) holds, then the integrals in 


(13.17) 


ON DIFFERENTIAL EQUATIONS. 177 


for j= 0,: -,&—1 are convergent and serve to define a function 
such that d*(Ams1) /dt® = Awmia™, so that € Also, (13. 13 m1) 
implies 

(13. 18m) S (1—0)6""G for m>0, 

(18.19m) | S (4/yn) (1 for <= T, m>0. 


By (12.9) and (13.17), the inequality (13.18,,) implies (13.7%). Sum- 
marizing this paragraph, 


(13.20) (13. 5m-1), (13. 5m), (13. 6n-1); 
(13. 13-1) imply (13. 5ms1), (13. 6m), (13. Ym): 


It follows from (13.8), (13.12) and (13.20) that, in order to prove the 
existence of the sequence wo==0,w,,:-- satisfying (13.5)-(13.7), it is 
sufficient to verify the existence of w, such that w, and Aw; =w,— w=, 
satisfy (13.5,), (13.6,) and (13.7%). 

It is clear from (12. 14)-(12.15) and fp = F(¢,0,- - -,0) that fo€ Masso. 
Hence, by Theorem 6.1,, (13.45) has a unique solution w,™ given by (13. 2) 
and (—1)*w,™ € Man... Also, by (6.3) and the remarks about it, 


(13.21) OS (—1)* wi | Safo/ah 


fort=T. Thus (12.8), where f =f, shows that (13.3,) is meaningful for 

j=0,- - -,4—1 and defines a function w;€ With d*w,/dt* = w,™. 

Also, (13.35) and (12.8) imply (18.7%)). This complete the induction. 
Consequently, w(t) =lim w,)(t), as m—>o, exist uniformly for 

(hence, for j—k+2,---,k-+n-+2 also) on compact 

subsets of ¢= 7. It follows that the limit function w—w(t) is a solution 

of (12.11) for £=T, is of class My,,(T,0), and satisfies 

(18.22) (—1)4wA(t) for t=T, j=0,- -,k—1. 
Also, by (13.18,,) for m= 1 and (13. 21) 

(13. 23) (—1)*wm (t) S G(t) + yafo(t)/q(t) for m=0. 


Thus, by Lebesgue’s theorem on majorized convergence, it follows that, one 
can let m tend to 2 in (13.3). In particular, (13.22) can be sharpened to 


(—1)/w(t) = (— (s—t)3*[@(s) + ynfo(s)/q(s) ]ds/(k—j—1)}, 
so that w)—+>0 as t->co for j=0,---,k—1. By (12.4) and (12.15), 
F(t,w(t),- (—1)**w®)(t)) € Maso(T,0). Thus if (12.11) is con- 


12 


178 PHILIP HARTMAN. 


sidered as a second order equation for w™, it follows from Theorem 6.1, that 
(—1)*w* € 0), hence w€ ). 

Since 7’ > 0 is arbitrary for Theorem 12.3,, the existence statement in 
that theorem (n = 2) is proved. 

In order to complete the proof of the existence statement in Theorem 
12. 2n, nS 2, it is sufficient to verify the uniform convergence of the approxi- 
mations w,(t) and their derivatives on closed subintervals of 0< tT. 
For the remainder of this section, assume that (12.11) is the equation (12.1) 
and thatO0 << fStST. 

By (13.131) and (13.15), it follows there exists a constant C = C'(t,) 
such that 


T 
(13. 24) | | SOC | Afm(s)|ds-+ 6} for m=1; 
cf. the remarks following (7.5) for the estimate of f, ‘ in (13.15). Repeated 
T 
integrations of (13.24) over the interval (¢,7’) and (13.7) give 
| | 
SO{ (s—#)*| Afm(s)| d5/(j—1)!+ 
t i=0 


for j=1,- - -,k—1 if C is sufficiently large. Thus, by (13.9), 


(13. 25) 


| Afmmes | 


(13. 26) t k-1 k-1 
T j=0 j=0 


In (13.26), C—C(t)) is a sufficiently large constant (independent of m). 
The existence of C depends on the linearity of (12.1), so that Fj? in (13.9) 
is merely the coefficient function g;(¢) in (12.1) and hence has a bound for 
to) StST independent of m. 

An induction on m shows that 


m-1 
(13. 27) | | S CD (T—t)4/j! 
i=0 
form=1. Thus, 
Afme |S C( ( 2 (Ch)* (T’— to)*/j!) <0. 
In view of (13. 24)-(13.25), it follows that w9) —lim w,, exists uniformly 


for 0<tStST if j=0,---,k. Since an inequality similar to (13.24) 
holds for Awm,,%*, the uniform limit relation is valid for j =k +1; hence, 


at 


ON DIFFERENTIAL EQUATIONS. 179 


for j=k+2,---,n+k-+2 also. This completes the existence proof in 
Theorem 12.2,, nS 2. 


14. Uniqueness in Theorems 12. 2, and 12.3,. The uniqueness asser- 
tion for n==0 implies uniqueness for n=0. Thus it will be supposed that 
n=0. In view of Theorem 6.1, it can be supposed that k= 1. 

Let w—w/(t) be the solution of (12.11) just constructed in the last 
section. Suppose that w= W(t) is another solution with the stated properties. 
If W(t) is compared with the 0-th approximations w,—0, it is seen that 
the case m = 0 of 


(14.1) (—1)4(W® —w,) 20 for j=0,° -,k 
holds. An induction shows the validity of (14.1) for m=0,1,---. Thus 
(14. 2) (—1)4(WO —w) for j=0,-- -,k. 

Let T be so large that (13.22) holds and that the analogous inequalities 
hold for w= W(t). Let 
(14.3) AWm = W— Wm 
and 
(14.4) Afim—= F(t, W,- +, (—1)*?*W*") — F(t, +, (—1)** wn 
Then, the case m = 0 of 


holds. On repeating the arguments leading to (13.7%), it is seen that 
(14.5) holds for m= 0. Hence, W—w—lim Awm, as m—> 0, is 0 for T. 
Since W—w is a solution of a linear, homogenous differential equation for 
t>0, it follows that W—w=0 for t>0. This proves the uniqueness 
assertion. 


15. Proof of Theorem 6.1,, q(0) <o. Letn>2 and0<q(w)<o 
in the statement of Theorem 6.1,. Let the differential equation (5.4) be 
differentiated n —2 times to give 


n-3 
(15.1) + qw-2) +S Dw fr-2, 


If this equation is identified with (12.1), then k—n—2, 


93 = (—1) 


= 


180 PHILIP HARTMAN. 


and the f on the right-side of (12.1) is (—1)"*f®).. The assumptions of 
Theorem 6.1, imply those of Theorem 12.1, for q and that g;, (—1)"*f(-) 
€ Mo41,0. Clearly, condition (12.2) holds, and (12.3) hold when g(0) <om., 

Since Theorem 12.2., has .been. proved, it follows that. (15.1) has a 
unique solution w=w(t) € Manic. Successive integrations of (15.1) show 
that w—w/(t) satisfies 


(15.2) + qu—=f + Pas(t), 


where P,3(t) is a polynomial of degree n—3. But since g(o) <o, 
w€ Manso imply w’, qu, f—>0 as to, it follows that P,».(¢) —0. Hence, 
w= w(t) is a solution of (5.4) with the desired properties. The uniqueness 
of this solution follows from the uniqueness of solutions of class Mani. for 
(15.1). 


16. Proof of Theorem 6.1,. It remains to prove the cases of Theorem 


6.1,, where n > 2 and g(o) =o. 

By assumption, g’€ M,. Hence (—1)/*q=0 for 
and g‘(c) 0 for j=1,---:,n. There exists a sequence of functions 
hi, ont >0, such that 0S (—1)"h, (—1)"ho SS (—1)"q”, 
hm = 0 for large ¢t, and (t) =lim h»,(t), as m—>o, uniformly on compact 
subsets of ¢>0. Define gm(t) on ¢>0 by 


(t) (t), Gm (co) =0 for j= and gm(1) =q(1); 
so that, in particular, 
7 


and € Mn, <oo. It is clear that (—1)4g,( = (—1)4q for 
Also 


(16. 1) > as m—> oo 
uniformly on compact subsets of ¢ > 0 for In particular, 
after discarding a finite number of qm, if necessary, it can be supposed that 
> 0. 

By the cases of Theorem 6.1, already proved, it follows that 
(16. 2) + =f 


has a unique solution w= w,»,(t) of class Manse. If Gm(t,s) is the Green’s 
function belonging to (16.2), then 


(16.3) Wm (t) (s)as 


ON DIFFERENTIAL EQUATIONS. 181 


By the proof of Theorem 6.1, in Section 7, qm(t) >0 for t=T implies 


(16.4) | Gm(t 3)f(8)as | Saf (L)/am(t)- 


It is clear from the uniformity of (16.1) for 70 that Gn(t,s) > G(t,s), 
m—>o, uniformly on compact subsets of s=t>0. Hence (16.3), (16.4) 
imply that wm(t) > w(t), m—>oo, uniformly on compact subsets of ¢>0; 
hence w= w(t) is the solution (5.7) of (5.4). Similarly, it is shown that 
Wm (t) > w’(t), m—>co, uniformly on compact subsets of ¢ > 0. 

It then follows from the differential equations (5.4), (16.2) and from 
(16.1) that m—o, uniformly on compact subsets of ¢>0 for 
Thus w,€ M, for m=—1,2,--- implies that we My. 
Since w € Mo,2 by Theorem 6.19, w€ Mnn+. Let (5.4) be differentiated n — 2 
times, 


n-3 


If the right side of (16.5) is multiplied by (—1)*"-, it becomes a function 
of class Ms. Thus, considering (16.5) as an inhomogeneous differential 
equation of second order for w‘"-?) with known right side, Theorem 6. 1, implies 
that there is a unique solution w("-?) = w("-?)(¢) such that (—1)*-?w-2) € Mo. 
Uniqueness refers to the class of solutious satisfying (w as 
Thus this solution w"-?) == w(-?)(¢) is the (n—2)-nd derivative of (5.7). 
Hence w€ Manse. 


17. Proof of Theorems 12.2, and 12.3,, completed. In view of 
Sections 13 and 16, it can be supposed that k= 1 and n>2. Let w.=0, 
w,," * * be the sequence of successive approximations defined in Section 13. 
By virtue of Theorem 6.1,, n > 2, proved in the last section, and a simple 
induction, it is seen that the m-th approximation w,» is of class Maxznszso for 
m==0,1,- - Furthermore, the convergences of the sequences {Wm}, {Wm’}, 
‘+ +, {wm} proved in Section 13 implies the convergence of the sequences 
{Wm**)} +, {Wm Consequently wm € for m=—0,1,- - implies 
that w = lim wy» is of class Since w€Mn,2,9 by Theorems 12.2, or 12. 
it follows that w€ Mask 

Differentiating (12.1) or (12.11) n—2 times and applying an argument 
similar to that at the end of the last section shows that w€ Masrnirse. 


“~~ 
2) 
a 
> 
38 
r 
1 
t 


PHILIP HARTMAN. 


Part IV. Higher order monotony of | |*. 


18. The differential equation of Appell. It was remarked by Appell 
({1]; cf. [10], p. 298) that to a linear, homogeneous, second order differential 
equation, say L,w—0, there corresponds a linear, homogeneous, third order, 
differential equation L;w—0O such that if wu=<z(t),y(t) are arbitrary solu- 
tions of L.u—0, then w=—z(t)y(t) is a solution of When 
L.u=0 is of the form 


(18.1) u” + q(t)u=0, 


then L,;w =0 is given by 
+ 4q(t)w’ + 2q’(t)w =0. 


An application of Theorem 12.1, (and Remark 1) to (18.2) gives 


(18. 2) 


THEOREM 18.1,. Letn=0. Let q(t) possess a derivative q’(t) of class 
and0<q(w) Then (18.1) has a pair of solutions u = x(t), y(t) 


such that 

(18. 3) w=2*(t)+y°(t) >0 
satisfies 

(18. 4) w(t) —1€ 


(The pair of solutions (x,y) of (18.1) m (18.3) ts unique up to their 
replacement by (ax -+ by, cx-+ dy), where a, b, c, d are constants such that 
a? + = b? + d*?=—1, ab-+cd=0.) 


A corollary of this assertion and the theorem of Hausdorff-Bernstein is 


CoroLtLaRy. Let q(t) possess a derivative q’(t) of class M, for n=1, 
and 0<q(w)<o. Then (18.1) has a pair of solutions 
u=—=x(t),y(t) such that (18.3) has a representation of the form 


(18. 5) w(t) for 0 
0 
with a non-decreasing weight function o=a(s). 


19. Proof of Theorem 18.1,. Let wu—w,(t), u(t) be linearly inde- 
pendent solutions of (18.1). Then the general solution of (18.2) is a linear 
combination of u,*,WiU2,U2*. It follows that if w—w(t) is any solution of 
(18.2), then w(t) can be written either in the form w= +[z2?(t) + y?(t)] 


182 


ON DIFFERENTIAL EQUATIONS. 183 


or in the form w= 2?(t) —y?(t), where u(t), y(t) are (possibly trivial) 
solution of (18.1). 


ell The conditions of Theorem 18.1, imply that (18.2) satisfies the condi- 
tial tions of Theorem 12.1,, where k —1, q is replaced by 49, go(t) = 2q’(t) and 
ler, f(t) =0. Since the derivative q’(¢) is integrable over 1 St 
lu- Hence the monotony of q’ implies g)(«) 0. Thus, Theorem 12.1, and 
1en the Remark 1 following it imply that (18.2) has a unique solution w= w(t) 


satisfying (18.4). Because of the oscillatory nature of the solution of (18.1), 
this w(t) must be of the form (18.3), where ua, y are linearly independent 
solutions of (18.1). 

The uniqueness of the solution w= w(t) of (18.2) implies the unique- 
ness of x,y as specified. For the identity x? + y? = (ar + by)? + (cx + dy)? 
and the linear independence of the solutions w= ry, y? of (18.2) imply 
the given relations between the constants a, b, c, d. 


188 
t) 20. The case q(o)—o. Theorem 18.1, and its Corollary have 
| analogues in the case q(t) > as to. 
THEOREM 20.1,. Let n=0. Let q(t) possess a derivative q/(t) of 
class Mn,, and =o. Then (18.1) has a pair solutions u= x(t), y(t) 
such that (18.3) satisfies 
(20. 1) (t) € Masnss: 
at (The uniqueness assertion of Theorem 18.1, is valid.) 
The analogue of the Corollary of Theorem 18.1, is 
° Corotuary. Let q(t) possess a derivative q'(t) of class M, for 
,, n==1,2,--- and q(w)=oo. Then (18.1) has a pair of solutions 
Ts u=ax(t),y(t) such that (18.3) has a representation as a Laplace-Stieltjes 
integral 
(20. 2) w(t) = f estda(s) for t>0 
0 
with a non-decreasing weight function o=a(s). 
" 21. Proof of Theorem 20.1,. The use of the Riemann-Liouville change 
. of variables (2.7) in (18.1) leads to the differential equation (1.2), where 
F Q is given by (2.1). Note that Q—1=0 and 


dt 


184 PHILIP HARTMAN. 


Thus, by a theorem of Bécher (cf. [12], p. 261), (1.2) has a pair of solu- 
tion U =X (s),¥Y(s) which satisfy, in terms of the s-variable, as s—>oo, 


X(s) =coss+o(1), dX /ds = —sins+0(1), 
Y(s) =sins+ o0(1), dY /d(s) =coss-+o(1). 


If x(t) =qi(t)X(s), y(t) are the corresponding solutions 
of (18.1), then the boundedness of q’ and ds = qi dt imply that, as t—0, 
= g¥(t)(— sin s + 0(1)), 

y(t) =qi(t)(sins+ o0(1)), = g¥(t)(coss + 0(1)), 


where s=s(t) as t>0. 


(21. 1) 


Consider the general solution w= w(t) of (18.2), 
w= + bry + cy? 


which can be written in the form 


(21.2) w= alz? + y*] + [be + (c—a)yly. 
Thus, by (21.1), the derivative satisfies 
(21.3) w’ =ao(1) + A[sin(2s+ 6) + 0(1)], 


where 6—6(a,b,c) is a constant, 
(21.4) A =[b?+ (c—a)?]4=0, 


and the 0(1) terms have a monotone majorant, say, «(t) 0 as t->0, 
independent of a,b,c. It follows that if 7 is large and w’(t) does not change 
signs on the interval $7 T, then 


(21.5) and AS |a|e(4$7). 


Suppose that for every large 7’ > 0, there exists a function qr = qr(t) 
on t > 0 with the properties that qr(t) = q(t) for0 <<¢ST and that qr(t) 
satisfies the assumptions of Theorem 18.1, (that is, g and qr satisfy the same 
conditions except that 0 < qr(#) <o while =o). 

By Theorem 18. 1,, 

(21.6) v” + gr(t)v =0 
has a pair of solutions v=<zy7(t),yr(t) such that wr=—sy? + yz? satisfies 


wr(t) —1€ yr(t) are solutions of (18.1) 
and so, wy is of the form (21.2) on this interval. In particular, (21.2) is 


ON DIFFERENTIAL EQUATIONS. 185 


of class My.,(0,7') and, hence, (21.5) holds. After (21.2) is multiplied 
by a suitable positive constant, it can be supposed that a—+1 and that 
(21.2) is of class Mai(0, T). 

In order to show the dependence on T, rewrite (21.2) as 


(21.7) w= (t) = ape? + + cry®, 


Let 7 tend to o through a sequence of values for which a—=limay=+1 
exists. Then it is clear from (21.4), (21.5) that 


(21.8) lim w7(t) = + y?) 


T> © 
and that this limit relation can be formally differentiated n+ 3 times. If 
S>0 is fixed, then wr€ My,,(0,8) if T >S. Hence a—1 and the limit 
function w = x? + y? is of class M,,,(0, 8) for every S, i.e., w= a? + y? € May. 
By (21.1), w and w’ tend to 0 as t-»00. Hence, if (18.2) is considered 
as an inhomogeneous, second order equation 


(—w')” + 49q(t) (—w’) = 2q’(t) w(t) 


for —w’, then the (known) right side is of class My,1,5._ Hence, by Theorem 
6.1,, —w’€ Manse, that is, w€ 

Thus, in order to complete the proof of Theorem 20.1,, it remains to 
verify the existence of the function gr(¢) with the stated properties. Let 
T > 0 be large and fixed. Since it is desired that gr(¢) =q(t) for0 < tT, 
it suffices to define gr(t) on ¢= T so that gr’ € (T,0), (T) = q(T) 
for =0,---,n+2 and <o. 

Note that for 7—1,---,n+1 


or, equivalently, for 7—0,-° -,n, 

(21.10) — (— (6 +7). 
But implies 

(21. 11) (—1) + T) 


The relations (21.106) mean that the reduced moment problem for 
j=0,- 


(21.12) w= fs dy(s), 


186 PHILIP HARTMAN. 


where pj= (—1)"9(j!)q@*9(T), has a non-decreasing (absolutely con- 
tinuous) solution y(s) = (—1)"*q¢"(s+T7) for s=0. It is then clear 
from the conditions for the solvability of the Stieltjes’ moment problem ([8], 
p. 6), that the finite sequence po,- - -,un can be extended to an infinite 


sequence po, for which there exists a non-decreasing function on 
s=0 satisfying (21.12) for 7—0,1,---. 
In terms of such a solution y, define for 7—i,- --,n+1 and t=T, 


(21.138) gr (t) = (— (s—t)™4 dy(s)/(n+1—j)! 


(21. 14) gr(t) +f” as. 


It is clear from (21.10) and the definitions of po,- - -, mn that this definition 
of gr(t) for t=T together with the relation gr(t) q(t) for O0<tST 
give a function gr(¢) on ¢ > 0 such that g7’(t) is of class My_, and qr() <0. 
Since y may not be continuous, gr need not have an (n-+-1)-st derivative. 
But it is clear that (—1)"*qr™(t) is non-negative, non-increasing and 
convex. For the purposes of the proof of Theorem 20.1, such a function 
gr(t) will suffice; cf. the remark following the Definition 6.1 of the class 
M,,(a, b). 


22. The case of non-increasing q. If qg is non-increasing, one has the 
following analogue of Theorem 20. 1,. 


THEOREM 22.1,. Letn=0O. Let q(t) be non-increasing and let (18.1) 
be oscillatory at t=; in particular, q(t) >0. Let 1/q? have a derivative 
of class DMy,,. Then (18.1) has a pair of solutions u=-2x(t), y(t) such that 
(18.3) satisfies 


(22.1) w’(t) € DMa nse 
and 
(22. 2) w(t)—1 or w(t) as to 


according asq(#«) >O0org(w)=—0. (The uniqueness assertion of Theorem 
18. 1, 1s valid.) 


For the definition of the class DM,, see Definition 6. 3. 
Proof of Theorem 22.1,. Divide (18.1) by g and differentiate to obtain 


(22.3) (u”’/q)’ +w =0. 


ON DIFFERENTIAL EQUATIONS. 


Introduce the new variables 

(22. 4) U =w’, dr = q(t) dt, 

so that (22.3) becomes 

(22.5) D?U + (1/q)U =0, where D = d/dr = (1/q)d/dt. 


As in Section 11, it is seen that 0 << ¢ <oo is mapped onto (—oS)T°<r<a 
and that (22.5) satisfies the assumptions of Theorem 20. 1). 
Hence, (22.5) has a pair of solutions U X,Y such that 


(22.6) W=X?+ Y?>0 
and either 
(22.7) W—1 € or WE DM 


according as g(oo) >Oorg(o)—0. Let u—z,y be the solutions of (18.1) 
satisfying X =a’, Y=y’; cf. (22.4). Note that W=—2?-+ y” and that, 
by (18.1) and (18.3) 


— DW =— (2/q) + yy”) yy) =w’. 


Hence, (22.1) follows from (22.7). 

The assertion that w(t) > 1 as t—>o in the case g() > 0 can be proved 
by the argument at the beginning of Section 20 involving the Riemann-Liou- 
ville change of variables. 

When g tends monotonously to 0 and (18.1) is oscillatory, (18.1) has 
at least one solution which is unbounded as too; [4], p. 529(i). Hence 
lim sup w(t) oo ast—»o. Since w is monotone by the case n = 0 of (22.1), 
it follows that w(t) as t—>oo in the case —0. 


THE JOHNS HOPKINS UNIVERSITY. 


REFERENCES. 


[1] P. Appell, “ Sur la transformations des équations différentielle linéaires,” Comptes 
Rendus (Paris), vol. 91 (1880), pp. 211-214. 

[2] P. Hartman, “On oscillators with large frequencies,’ Bolletino della Unione 

Matematica Italiana, vol. (3) 14 (1959), pp. 62-65. 

» “On the existence of large or small solutions of linear differential equa- 

tions,” to appear. 


[3] 


187 
l 


188 PHILIP HARTMAN. 


[4] and A. Wintner, “ On non-conservative linear oscillators of low frequency,” 
American Journal of Mathematics, vol. 70 (1948), pp. 529-539. 

[5] and A. Wintner, “ Linear differential equations with completely monotone 
solutions,” ibid., vol. 76 (1954), pp. 199-206. 

[6] and A. Wintner, “On a problem of Poincaré concerning Riccati’s equation,” 


ibid., vol. 77 (1955), pp. 791-804. 


[7] P. Schafheitlin, “ Die Lage der Nullstellen der Besselschen Funktionen zweiter Art,” 
Sitzungsberichte der Berliner Mathematischen Gesellschaft, vol. 5. (1906), 
pp. 82-93. 
[8] J. S. Shohat and J. D. Tamarkin, The problem of moments, New York (1943). 
[9] G. N. Watson, A treatise on the theory of Bessel functions, 2nd ed., Cambridge 
(1958). 
[10] and E. T. Whittaker, A course of modern analysis, Cambridge (1940). 


[11] A. Wintner, “On the normalization of characteristic differentials in continuous 
spectra,” The Physical Review, vol. 72 (1947), pp. 516-517. 


[12] , “Asymptotic integrations of the adiabatic oscillator,” American Journal 
of Mathematics, vol. 69 (1947), pp. 251-272. 
[13] » “On a principle of reciprocity between high- and low-frequency problems 


concerning linear differential equations of second order,” Quarterly of 
Applied Mathematics, vol. 15 (1957), pp. 314-317. 


ge 


al 


18 


SYMMETRIC PRODUCTS AND JACOBIANS.* * 


By ArtTHuR MatTTUcK. 


The n-fold symmetric product C(n) of an algebraic curve C is a variety 
closely related to the Jacobian variety J of the curve. The low symmetric 
products appear birationally as a family of subvarieties of J for which there 
is no good analogue on other abelian varieties and which have been used by 
Matsusaka to characterize Jacobians intrinsically among abelian varieties. 
The higher symmetric products are used to construct the Jacobian, either by 
the excisions-and-glue method of Weil, or the more precise projective method 
of Chow. 

Now Chow’s construction of the Jacobian as a quotient variety of C(n) 
“fibered” by the linear systems [3] raises the question of whether C(n), for 
n > 2g —2, is actually an algebraic projective bundle over J. We have shown 
elsewhere [8] that this is so; it is thus natural to ask what the Chern classes 
(to speak somewhat loosely) of this bundle are. One of the objectives of this 
paper is to exhibit these clases as elements of A(J), the rational equivalence 
ring of J. Once this is done, one has according to a theorem of Grothendieck 
the structure of A(C(n)) explicitly as an extension of A(J). This gives 
for example in a natural form the structure of the homology rings of high 
symmetric products of the closed orientable topological surfaces, which have 
hitherto only been computed “in principle” by the use of Eilenberg-MacLane 
spaces. 

As a by-product of this determination of Chern classes, we get certain 
intersection relations among the subvarieties of J alluded to above which can 
be thought of as generalizing to lower dimensions the well-known (and obvious) 
“relation” © == ©*; they express W,* in terms of W,. 

The intersection relations on C(n) and J here given, in particular the 
basic formula of Section 7, have other applications. For example, they clarify 
and conceptually simplify the proofs of the intersection formulas for the W; 
given by Weil and Matsusaka [7] which play the crucial role in the charac- 
terization of J by Matsusaka previously alluded to, also the formulas from 


* Received September 26, 1960. 

* This research was supported in part by the United States Air Force through the 
Air Force Office of Scientific Research of the Air Research and Development Command, 
under contract no. AF 18 (603) -90. 


189 


"4 ” 

ne 
” 

t ” 

8 
of 


190 ARTHUR MATTUCK 


which Weil’s original proof of the Riemann hypothesis was derived. Again, 
they can be used as the basis for a geometric account of the Weierstrass points, 
with generalizations. These will appear separately. 

Part II of this paper is devoted to the Chern classes. Part I is pre- 
liminary, and has connection with theorems of Chow and Andreotti. In it 
we prove that if C is a non-hyperelliptic curve, then any g—1 points of a 
generic canonical divisor are algebraically independent (but as will be seen, 
only just!). In addition to being used in a critical argument of Part II, 
we also use it to squeeze out the dimension and irreducibility of certain sub- 
varieties of C(n) which play an important role in Part II, as well as in the 
other applications alluded to above. 

Our emphasis throughout is on rational equivalence, not anything coarser. 


0. Preliminaries and notation. We will work throughout over a fixed 
algebraically closed ground field k, and “generic” will always mean with 
respect to this field. All our basic varieties will be defined over k, and all 
points and divisors used in constructions unless specifically called generic, 
will be understood to be k-rational. 

We denote by C a fixed projective (complete) non-singular curve over k. 
Then C(n) is its n-fold symmetric product, a non-singular projective variety 
most conveniently defined by the Chow coordinates [3, p. 456]. Its points 
represent the positive divisors of degree n on C. 

ixcept in Section 5, we shall reserve the werd “ divisor” exclusively for 
non-negative zero-cycles on C, and shall denote them by German letters 
a,b,- --. We will use a superscript to indicate their degree, wherever it is 
convenient in the argument to be reminded of it: thus a and a” in the same 
context represent the same positive divisor of degree r. The dimension of a 
divisor, dima, always the geometric (projective) dimension of the complete 
linear system |a| to which it belongs: so dima=I(a) —1. 

To avoid some tedious locutions, we shall often casually identify the 
point on C(n) with the divisor of degree n it represents, and thus speak of 
“the point a” on C(n).” Where we wish to be precise, we shall use p(a) 
for this point. Latin letters z,y,p,q,- - - will be reserved exclusively for 
points on C’; capital letters for varieties and cycles on them. Superscripts 
and subscripts on capital letters in general refer to dimension and codimen- 
sion, so that on C(n), X; and X"-* represent the same cycle. Where indices 
must be used to distinguish cycles, they will conform to this convention. 

If X is a non-singular projective variety, by A(X) we mean its rational 
equivalence ring graded by codimension, so that A,;(X) is the group of cycle 


SYMMETRIC PRODUCTS AND JACOBIANS. 191 


classes of codimension 7. An important formula we shall use constantly in 
the second part is the projection formula: Let f: V-—>U be a regular map, 
where V is projective (more generally, let f be proper). If X is a cycle on U 
such that f-?(X) is defined as a cycle, and Y is a cycle on V such that 
Y-f*(X) is defined, then in the sense of maps on cycles, 


=f(Y) 
the right side being automatically defined. 


Part I. 


1. The independence property. Suppose fixed a linear system % on 
the curve C, of degree n and dimension r (not necessarily complete), which 
we may think of as represented by the subvariety A” on C(n) associated with 
it; a generic divisor of 2 is then by definition one corresponding to a generic 
point of A. Recall that a positive divisor b is said to be contained in a divisor 
a if a=b; it is contained in Y if it is contained in some a€ YW. What can 
be said about the totality of positive divisors of some fixed degree m con- 
tained in 9? Their Chow points form a subset A[m] of C(m) about which 
we can say a little if we know that 2% has the 


Independence property. A linear system % of dimension r will be said 
to have the independence property if any r points occurring in a generic 
divisor are independent generic points of C. 


This notion has arisen incidentally in Chow’s construction of the Jacobian 
[4], and in a weaker form (linear independence) in Andreotti’s proof of 
Torelli’s theorem [1, p. 813]. The property does not depend on the choice 
of generic divisor. We have then 


THEOREM 1. If the linear system YU, of degree n and dimension r has the 
independence property, then 

(i) Alm] is a purely r-dimensional algebraic set on C(m) tf m>r, 
otherwise all of C(m), 

(ii) If mr, or nontrivially if m=n—r, then A[m] is even trre- 
ducible. This in particular will automatically be true (regardless of m) if X 


is complete and either of degree n > 2g —2, or the canonical system—always 
assuming it has the property. 


Proof. The points of A[m] are those representing divisors y, +--+ + Ym, 
Where for suitable y, («> m). 


n, 
e- 
a 
n, 
a 
le 

1 


ARTHUR MATTUCK 


Let a=z,+--+-:+2, be a generic divisor of the system %&. Then 
x2— > is a specialization, since A’ is irreducible and a is generic; an 
extension of the specialization to the 2; takes them in some order onto the yj, 


so that ‘+ Ym is a specialization for some for some 
choice of the xz; In other words, every divisor represented by a point of A[m] 
is a specialization of at least one of the »,C,, divisors z;,-++- + --+ 2;,, of degree 


m contained in a, and conversely, by similar reasoning it is clear that any 
specialization of one of these is represented by a point of A[m]. 

Now by the independence property, any one of this finite number of 
divisors is either made up entirely of independent generic points (if m =r) 
or contains r of them (if m=r), hence the locus of its specializations is 
respectively either m-dimensional (and therefore all of C(m)) or r-dimen- 
sional. The union of these loci is as we have seen A[m], which proves state- 
ment (i) of the theorem. 

To show the irreducibility of A[m] if m2=n—r, it being C(m) if 
m =r and therefore trivially irreducible, what we clearly must show is that 
any two of the ,C, divisors z;,-++- - --+-2;,, are specializations of each other. 
Let therefore a, and a, be two such, so that 


a=a,+b,—a,.+ bh, 


where the b; are positive divisors of degree nm —m, which is =r by hypothesis. 
By the independence property the b; are each made up of n — m independent 
generic points, so that there is a specialization b, >b,. Let aj’ be a positive 
divisor of degree r— (n—m) in q;; then 6; + aj’ is of degree r, has therefore 
only independent generic points, and so the specialization extends to b, + a,’ 
—b,+a,’. Extend it now to a,+56,—c. Then c is a divisor containing 
b. + a,’ and it is also a divisor of 2%, since it is a specialization of the generic 
divisor a. Since A is of dimension r, and b, + a,’ has r generic points, there 
can be only one divisor of a containing b,+-a,’, so that c—a. We have 
therefore a specialization 


extending b, > b., so that a, —> a2 is a specialization also, as was asserted. 


As to the remaining statement, when will every positive m between 0 
and r be either =r or 2>n—r? If n—r=r-+1, that is, if n= 2r+1. 
Now if the system % is complete and of degree n=2g—1, then by the 
Riemann-Roch theorem, r—=n—g, so indeed n= 2(n—g) +1 while if 
is the canonical system, n = 29 — 2, r—g—1, and 2g —2=2(g—1) +1: 
we even have room to spare! 


192 


SYMMETRIC PRODUCTS AND JACOBIANS. 193 


2. An independence criterion. The following criterion results from 
analysis of a proof of Chow [4]. 


A linear system % of dimension r has the independence property <=> 
for some choice of r—1 points y,,- - +, Yr+, the linear system U— Dy; has 
dimension one and no fixed points. 


Here 2’—->\y; denotes the residual system (of degree n—r-+1 if 
has degree n). Of course for general choice of the y; the system will always 
have dimension one, but it may have fixed points. 


Proof. Let z,-+- +--+ 2, be a generic divisor of 2%. Then the indepen- 
dence property is equivalent with the “generic” condition: 

For every choice of r—1 independent generic points from among the 2, 
the system %— (a,-+----+2;,,) has no fixed points. 

Namely, this system is rational over k(2;,,- - -,%i,,), where & is a field 
of definition for C and % over which the 2;, are independent generic points, 
and it has (2, +--+: -+2a,)— (a, +--+ ++ as generic divisor. Thus 
a point of C is a fixed point of this system if and only if it is one of the 
remaining 2; and is algebraic over k(2,,- - -,vi,,). Now if & doesn’t have 
the independence property, some r of the 2; are not independent, so we can 
indeed find such an algebraic 2;,, and conversely the existence of such a situa- 
tion means that 2;,,- - -,2;, are not independent generic points, so that W 
does not have the independence property. 

This proves the forward implication of the criterion, since if 2% has the 
property, one can take as the y; just the points 2;,,- - -,a;,,.. The implication 
is reversed by showing first that %—S2;,27%— Sy; is a specialization 
(viewing say the linear systems as irreducible cycles on C(n—-r-+1)). From 
this the theorem follows, for if the latter system has no fixed points, neither 
does the former—for example if you consider a finite set {b,;} of divisors with- 
out common point from the second system, then a set of foreimages {a;} for 
them in the first system also can have no common point. 

So relabel the independent generic points 2j,,- 
Make a specialization (24,° and extend it to 
%—>S2:—-%8, where B will be a one-dimensional cycle on C(n—r-+1). 
We show the support of B coincides with the one dimensional system % — > y; 
(this is enough for our theorem) by showing its points all represent divisors 
of the latter system. Indeed, any such point represents a divisor y, ++ +:-+ Yn 
that is, over the preceding specializations, itself a specialization of the generic 
divisor +--+, 2, of the first system. Thus 2, -+---+ a, Yn 


13 


194 ARTHUR MATTUCK 


is a specialization extending the preceding ones. Now since 2,-+-- - -+ 2, € Y, 


so does yn, so that as 


was asserted. 


3. Applications of the criterion. 


(a). The following is due to Chow [4]. If n > 2g, then every complete 
system XU of degree n has the independence property. Namely, the system has 
dimension r= n—g; let - -+ 2, be a generic divisor, where the first 
r points are independent generic, and choose for the y; the last r—1 points. 
Then But n>2g implies that g+1<,, 
hence the 2,,- - *,%gs, are independent generic and so this system has no fixed 
points, as required. 


(b). THroreM. If C is not hyperelliptic, then the canonical system B 
has the independence property. 'Take the y;:,°*-,Yg-2 to be independent 
generic points, so that dim ¥W—>y;—1. We have to show this system has 
no fixed point p. If it did, then dim¥&—S>y,—p—1, so that by the 
Riemann-Roch theorem, dim| y+ p|—1. This is impossible however, 
because it is known that [1] the special divisors on C (g — 1)—those belonging 
to linear systems of positive dimension—lie on a closed set whose dimension 
is less than g — 2, and which therefore cannot contain the g — 2 dimensional 
point which represents y,+°--:+4y,.2+p. [Briefly, one considers the 
canonical mapping of C(g—1) onto projective g—1 space defined by the g 
symmetric regular g —1 forms on C(g—1). If C is not hyperelliptic, this 
map fails to be defined exactly where these differentials all vanish—in the 
hyperelliptic case it is always defined—and by direct calculation this occurs 
exactly at those points of C(g—1) representing special divisors. Since 
C(g—1) is nonsingular, this fundamental locus for the map must be of 
dimension g —3.] 

This result is of course false if C is hyperelliptic. 


(c). We do not need the following application in this paper, but have 
used it elsewhere to construct cross-sections of the projective bundle over the 
Jacobian [8]. 


Proposition. If % is a complete linear system on C of degree n > 2g, 
or if YU is the canonical system and C is not hyperelliptic, then the rational 
map associated with % is biregular, and the image C’ is projectively normal. 


Proof. The rational map we mean is the one turning the divisors of 4 
into hyperplane sections. Andreotti [1] has proved the second case of the 


SYMMETRIC PRODUCTS AND JACOBIANS. 195 


| theorem; we prove the first case similarly (it is the one used in [8]). If p, 
s ff and p2 are two distinct points on C, then by the Riemann-Roch theorem, 
| dim & — p, > dim Y% — p, — p,; thus there is a divisor through p, not passing 
through ps, 2 separates points, and the map is one-one. Biregularity follows 
from the projective normality of the image, and this in turn follows by showing 
the linear system &;, of hypersurface sections of degree k is complete for all k, 


which we now do. 


t Let 2 be the smallest linear system on C containing all divisors of the 
forma, +: a; in Clearly C we show dim A” = nk—g, 
which will prove % and therefore also &, is complete. 

| By the independence property, say, a generic divisor of % contains no 
repeated points. Let A and B be non-overlapping sets containing respectively 
n—g—1 and g of these points. Since n=2g-+1, B has never more than 
n—g—1 points (this many only when n —2g-+1). Since % has dimension 
n—g and the independence property, we can pass hyperplanes H, and H, 
through A and B respectively, each containing no other points of a. Add to 


them hyperplanes H;,- - -,H;,, not passing through any points of a. Then 
H,+- +++ Hy. is a hypersurface of degree k + 1 cutting out a divisor of 


#*) and passing through the n—1 points of A+B, but not the n points 
of a. Thus the n points of a must impose independent conditions on +), 
which shows that dim &Y“) => dim AX) —n; since dim A) =n —g, the argu- 
ment is complete. 


In the sequel an important role will be played by a set of varieties S, 
i==—1,0,1,- - -, which we now define. 


Definition. S™ is the set of all points on C(g-+ 7) representing special 
divisors those for which dim > i. 


THEOREM 2. S® is a variety of dimension g—1 for all i=—1, 
*,g—2, and otherwise empty. 


Proof. Exactly those divisors a%t+ are special which are contained in the 


canonical system, for by the Riemann-Roch theorem, 
dim |a| >i <=> dim | W—a|=0 wm =a for some we B. 


If C is not hyperelliptic, by what we have proved, the canonical system has 
the independence property, and thus the result follows from Theorem 1. 

If C is hyperelliptic, we must argue directly. There is then, by definition 
of hyperelliptic, a regular map f: CP" of C onto the projective line, of 
degree 2: that is, [k(C): k(P)]=2. The canonical system on C is com- 


| 


196 ARTHUR MATTUCK 


posed of all divisors of the form f*(2,-+- +--+ 21), where >) a; runs over 
the complete linear system of all positive divisors of degree g—1 on P; 
namely, this is obviously a linear system, of degree 2g—2 and dimension 
g—1, hence necessarily the canonical system. 

Let now the {2;} be independent generic points, and let f-1(2;) = y+ yj. 
Then the divisors of degree g +1 contained in the canonical system are all 
specializations of +: +--+ Yur’: this is trivial to see, 
but tedious to write out. The point on C(g-+7) representing this divisor is 
thus a generic point for S“, which is therefore irreducible and of dimension 
g—1. 

Part II. 


4, Some subvarieties of C(m). To avoid confusion, in this section only 
we shall use p(a) for the point on C(n) representing the divisor a on the 
curve C. 


There is a natural map 
f: C(r)X C(n—r) > C(n), 
defined by f[p(a"), p(b"")]—p(a+b). This map is regular, since it is 
single-valued and C(r) x C(n—r) is a normal variety. In fact, it is bi- 
regular on p(a’) X C(n—r), as is easily seen. 


We now define on C(n) a subvariety denoted by X[a] for all non-negative 
divisors a as follows (¢ denotes the empty set or divisor) : 


X[¢] =C(n), X[a"] = p(a), for r>n, 
X[a™] = image of p(a")x C (n—r) under f (1S rSin—1). 
Thus if r<n, then X[a"] is biregularly equivalent to C(n—r). 
We are interested here in the intersection relations of these subvarieties 
given by the next two propositions. 
Proposition 1. If a and 6 have no common points, then X[a] and 
X[b] intersect properly on C(n) and X[a]-X[6] =X[a+}]. 
Proof. Set-theoretically we have under the assumptions evidently 
X[a] X[b] —X[a+ 5], 


so that the intersection is proper. To show they intersect with multiplicity 
one, suppose first that a and b are independent generic divisors, so that 
neither a nor 6 has repeated points: 


pr and 4. 


SYMMETRIC PRODUCTS AND JACOBIANS. 197% 


We use C[n] to denote CXCX---XC (mn factors), and consider the 
obvious regular map of degree n! 
g: C[n] > C(n) 
defined by g(21,° *,2n) We claim first of all that 
g*(X[b]) XG X Cln—s], 
where the sum is over a set of n!/(n—s)! permutations of the factors of 
C[n] which make the summands on the right all distinct. In fact this 
relation is evidently true, set-theoretically, all coefficients must be the same 
for the summands on the right by reason of symmetry ; if this coefficient is m, 
we have on applying g to the left side n!X[b], and applying it to the right 
side 
= So(n—s) = m[n!/(n—s) !] (n—s) !X[B], 


whence m = 1. 


Now putting Y[a]—p.X---Xp,XC[n—r] on C[n], we have 
(assuming r-+ s <n, and defining C[0] = ¢) 


Y[a]-g*(X[b]) =p: X° pr X (SH de X C[n—r—s]), 


the sum taken over permutations + which make the summands distinct. 
Applying g, we get therefore by the projection formula, since g(Y¥[a]) 
= (n—r) !X[a], 
g(Y[a]) -X[b] = (n—r) !X[a] - X[b] 
= [(n—r)!/(n—r—s)!](n—r—s)!X[a+ 

so that our result X[a]-X[b] = X[a-+b] is proved. 

If now a’ and b’ are non-generic divisors, specialize (a,b) — (a’,b’) ; 
this extends uniquely in turn to 

a+bood’+’*, aXC(n—r)>d’ XC(n—r), X[a] 
and so on. Thus since the intersection of specialized positive cycles is the 
specialization of the intersection (if all intersections are proper), we get 
X[a’]-X[b’] = X[o’ + 

Proposition 2. Let é[a] denote the rational equivalence class in A(C(n)) 
of X[a]. Then é[a]-é[b] é[a+ 5]. 

Proof. If a" and b* have no common points, this is just a weakening of 
Proposition one. If they do, find finite sets of divisors {a;"} and {a,’"} having 
no points in common with 6 such that p(a)~>p(ai)— SD p(a’;), the 


. 
] 
y 


198 ARTHUR MATTUCK 


rational equivalence being on C(r) ; it is a question only of avoiding certain 
subvarieties of C'(r) we shall not make explicit. Then 

X[a] ~YX[a] 
on O(n), since X[a] =f(p(a) xX C(n—r)) and rational equivalence is pre- 


served by regular projective (proper) maps. 
Now é[a]-é[b] is represented by 


YX [ai] -X[b] = X[a +b] —VAX[a/ +b] 
~X[a+b]¢€ é[a+ 5] 
since p(a+b)~> p(ai+b)—Sp(a/+6) on C(r+s), this being so 


because rational equivalence is preserved by the map of C(r) x C(s) > C(r + s). 


5. Chern classes of a projective bundle. As general references for 
what follows, see [5,10]. 


Suppose that (/,X,x’) is an algebraic vector bundle, that is, a fiber 
space in the sense of Andre Weil [12] whose fiber is a vector space V? of 
dimension p. In other words, the base space should be a non-singular variety 
covered by open sets {U;} such that 7’1(U;) is biregularly isomorphic to 
Ve U by a fiber-preserving isomorphism (local triviality), and such that 
the transition functions g’j;: U;Q U; > GL(p,k) are regular maps into the 
general linear group. We may then consider the derived algebraic projective 
bundle (P(E), X,7), the points of whose fibers 7*(a) are the lines through 
the origin in the vector space 7’-'(v). Formally it is given by the transition 
functions gi: U; 1 U; > PGL(p—1,k) derived from the natural homomor- 
phism: GL(p,k) > PGL(p—1,k). Since each point of P(/) represents a 
line, P(#) is the base space of a canonically determined line bundle whose 
dual bundle is denoted by L; associated with L is then a divisor class € in 
A,(P(E£)). 

The natural map z* is an isomorphism of A(X) into A(P(£)) ; call its 
image A()* and write c* for r*(c), if c€ A(X). Then Grothendieck has 
proved that 


A(P(E£)) =A(X)*[€], 
where the minimal equation for € is: 


The Chern classes of the vector bundle F are now defined to be the c;, and 
1+c¢,+-:-:-++e, is called the total Chern class of E. Note also that it 
follows from Grothendieck’s result that A,(P(/)) =A,(X)*-é+ A,(X)*. 


SYMMETRIC PRODUCTS AND JACOBIANS. 199 


So far we have started with the vector bundle and derived é and the 
projective bundle from it. If we begin with a projective bundle, we may ask 
to what extent ¢ in the above is uniquely determined (or even exists). 


Proposition 3. An algebraic projective bundle (F,X,2) together with 
a given element €€A,(F) is derived from a vector bundle (L,X,n’)as 
described above if and only if é-a1(x) is the class of a hyperplane in the 
projective space w*(x), x generic. 


Proof. For the necessity, since the restriction L’ of L to a fiber r*(z) 
== P?- is just the dual of the natural line bundle on P, we see that L’ is 
associated with the divisor class of a hyperplane section of P, from which it 
follows that €-21(z) is in A(m1(ax)) the generating element represented by 


a hyperplane in 


For the sufficiency, Grothendieck [6,§ 3.4] has shown that an algebraic 
projective bundle is always derived from a vector bundle; suppose therefore 
that our bundle F is derived from (Hy, X,2’) with & € A,(F) as associated 
divisor class, so that £):271(x) is the class of a hyperplane in 71(z). If now 
£€ A,(F) is any other element with this property, it follows from the structure 
of the group A,(F) as given above that =é ,*, where A, € Ai(1). 
Let L, be the line bundle on X associated with A,; then the vector bundle 


E=Ff,®L, is the desired bundle: it has é as associated divisor, and EH and 
EF, have the same derived projective bundle, namely F. 

Though this is all we need, for the sake of clarity we add a few remarks. 
t is easy to see that any vector bundle from which F is derived is of the 
form 2 @ L,, where L, is a line bundle on X. If now X is complete, we get 
in this way a one-one correspondence between elements of €-++ A,(X)* and 
vector bundles producing F’. Now if € as above is a root of the polynomial 
f(X) = then €-+A,* is a root of f(¥ —A,*) = d;*X?-*, whose 
coeflicients are polynomials in ¢ and A, (which are easily calculated) and in 
fact the Chern classes of H@L,. In other words, if we envision the elements 
A€ A,(X) as acting on A(X) as an additive group of automorphisms by 


(n = dim XY) 


the d; being determined as above, then what is an invariant of a projective 
bundle is the orbit of Se; under the group A,(\): the elements of the orbit 
are in 1-1 correspondence with the Chern classes of the vector bundles 
from which F’ is derived, and the orbit can reasonably be called the Chern 


class of F. 


200 ARTHUR MATTUCK 


6. Statement of the result. We fix once and for all a point p € 0. 
Then according to Section 4, we have a nested sequence of subvarieties of 
C(n): 

X[po] D X[2%po] D- -D X[npo] 


which we shall abbreviate as ¥;— X [ipo], so that X; is of codimension 7 on 
C(n). If we denote the rational equivalence class of XY, by é, Proposition 2 
shows that X; represents the class factors). Hach X; is 
biregularly isomorphic to C(n—1). 

Now let J be the Jacobian of C. Using our point po, we fix the canonical 
map ¢: CJ by making ¢(po) = @, the identity point of J. Then by linear 
extension of ¢ to divisors, we get a map 


C(n) Jd. 


We have proved elsewhere [8] that if n > 2g —2, which we shall henceforth 
assume, this triple (C(n),J,7) is naturally a algebraic projective bundle 
whose fibers are the linear systems. On it we show now that the subvariety 
X, satisfies the hypotheses of Proposition 3: namely, for z generic on J, 
X,:m1(x) is in the rational equivalence class of a hyperplane on 2?(z). 
For the divisors making up 72 are those containing these form 
a linear subsystem of dimension n —g—-1, in other words, a hyperplane in 
the projective space to which x*(z) is biregularly equivalent. To see that 
the intersection multiplicity is one, it is enough to show that (using Proposi- 
tion 1), X[a"9* + po] = consists of a single point 
with multiplicity one, if a is “general.” In fact, it consists of the divisors 
of x*(x) containing a*9* +p; it is well known that there is only one, and 
moreover, since the linear system represented by the points of 2 1(z) is 
rational over k(x) [8, p. 475], this unique divisor will be &(x)-rational and 
hence its representative point on C(n) will be &(x)-rational too. Under 
these circumstances, the intersection multiplicity at the point is one. 

It follows therefore that €, the rational equivalence class of X, in 
A,(C(n) ) is associated with a unique vector bundle # of rank p=n—g-+1 
from which the bundle C(m) is derived. We wish to compute the Chern 
classes of this bundle. 

To this end, we let 


so that W, for 0=1i<g consists of all points on J writable as ¢(z,) +: °° 
+ ¢4(z,i), and and Wi—¢ for i>g. To eliminate asterisks, we 
also put 


U; == W,* = (We)e, 


SYMMETRIC PRODUCTS AND JACOBIANS. 201 


that is, the transform of W; by the biregular map of J sending x into —x-+-¢, 
where ¢=-72(f9-? + (n—2g-+2)po) is the canonical point on J. 


THEOREM 3. As a cycle on C(n), we have 
p—n—g +1. 
Letting u; be the rational equivalence class of U; in A;(J) and using 
the fact that the class of X; is €', we get immediately the 
CorotuaRy. The total Chern class in A(J) of the vector bundle EF 
with derived projective bundle (C(n),J,7), n > 2g —2, and associated n—1 
cycle X,—=X[po] ts 
1—u, + (—1) 
7. Proof of Theorem 3. In what follows, in addition to viewing points 
of C(n) as divisors on C without further comment, we shall often think of 
points on J as divisor classes of degree 0 on C. Thus for example r(npo) =e 


and x(a) =Cl(a—mnpo), where Cl(a) means the divisor class to which a 
belongs. For reference, we state explicitly, 


(1) The divisors in X; are those of the form a" + ipo. 


(2) The classes in U; are those containing a representative of the form 
f—a?-i— (g +1—2)p, for some a (f is a canonical divisor). 

An essential auxiliary role is played by the g—1 dimensional varities 
S representing the special divisors on C(g-+1) that we introduced at the 
end of Part I. Under the biregular isomorphisms of C(g +1) onto Xy_(g,i), 
the variety S‘ is carried onto a variety we shall continue to denote by S‘, 
so that 
(3) SO C (ory, 
In particular, note the extreme cases: SC) = X,_9,, Xp since every divisor 
of degree g—1 is special and 8%) together with the higher S® are all 
empty. 

We first note that set-theoretically, 


This shows, incidentally, that if 1< j= g—2, then SY AS”, To prove it, 
using (1) and (2) above, we have 
(n—g— 1) Do € hott special 


|£—b| contains for some a 


ARTHUR MATTUCK 


f—a~ b => b— (9 +i) po ~ (9 +1) po 
<=> Cl(b— (g+%)po) contains for some a a divisor (g +71) p, 


<> (b+ (n—g po) € 
Our main effort now goes into proving now the basic relation 
(5) 4. i=1,:---,g, 
From this our theorem follows easily. For writing (5) out for different 1, 
(U 9-1) = S9-*) + 
m*(Ug) since = ¢, 
so that alternately adding and subtracting, we get 
which is the desired relation. 
Proof of relation (5). We first show the relation is true set-theoretically. 
The right hand side is clearly contained in the left, since if we remember 
that X‘ = X,_; by our conventions, we have using (3), 
C Xori-e C C Xori-1 — — 
Also, by (4) we have since Wi,, C Wi, 
= Ui, = Ui C Ui. 
Looking at the reverse inclusion, we have using (1) and (2), 
+ (n—g—i+1)p.) € p, for some 
<=> a— pp is special. 
There are two possibilities: either dima—dim(a—p,) or else dima=1 
+ dim(a—po). In the first case, po is a fixed point of | a|, soa—= + 
where dp is special, so that ay + (n—g—i-+ 2) py is in S*), In the second 
case, since dim(a— po) >1t1—2 (speciality), we deduce dima >i—1, so 


that a is special, hence a + (n—g—1+1) py is in 8S», 
We have finally to show the relation (5) is true as an intersection formula, 
that is, that the coefficients of the right side are both one. Suppose then that 


-Xpi + a,b > 0. 


Let 0%? be a generic divisor of degree g —1—n— p. Then by Proposition 1, 
X,i‘X[o] is defined and equals X[o0 + (p—zt)po]. If now we can show that 


SYMMETRIC PRODUCTS AND JACOBIANS. 


(i) the cycle Z=x1(U;)-X[o+ (p—t)po] is defined, 1—1,-- - 

it will follow by the associativity formula that 
Z =aS-*)- + 08% - X[o]. 

This, combined with 

(ii) Z consists of points occurring all with multiplicity one, 

(iii) S®OX[o] is not empty, 7——1,:--,g—2, 
will then imply that both a and 6 are one, completing the proof of the 
theorem. 

To prove statement (i), we have as before using (1) and (2), 

(p—i)p) € Ui a+o0~f—b*+ for some b 

&>a+b~t+ po—o. 
Now |£-+ p.| has dimension g—1, so that since 0%? is generic, 
dim | f+ po—o|=0 
and the system contains a unique positive divisor of degree g. Thus a+} 
is a well-determined divisor of degree g, which means that a must be one of 
the (in general) ,C; divisors contained in it. In other words, 
a NX[0 + (p—1) po] 

consists of a finite set of points. 

Going on to statement (ii) now, apply to (i) the map z and use the 
projection formula, obtaining 

+ (p—t) po]) = Ui: (Woi)a, 


where  =z(0 + (n—g+1)po) =7z(o). It clearly is sufficient to show that 
7(Z) has no multiple points, or by performing a translation by —z on J and 
remembering U;= (W;)., that (Wi-)--2 has no multiple points. How- 
ever, A. Weil has computed this zero-cycle for us [12, Prop. 17, p. 74], the 
result being }}(wea,---«,), Where the sum is taken over the ,C; combinations of 


indices taken 7 at a time, the points 


Wa, = ) +° i ‘+ $(qa;) 


and the ga, are defined by c—2 = > %6(qx) ; in these last two equations, ¢ 
is the canonical map of C into J, and the sums are taken in the sense of the 
group law on J. Our job is therefore to see that the wa,..-a, are all distinct. 

The gq; are determined by the above according to the relation f—o + po 
~ >, so we have to show that no two of the divisors of degree i contained 
in +--+ q,) are linearly equivalent. The system | £-+ p has dimen- 


203 


204 ARTHUR MATTUCK 


sion g—1 and p, as fixed point. One divisor of this system is by the above 
o+ > 4q:; it is in fact a generic divisor (since 0 is generic), and since py is a 
fixed point of the system, say Then 0+ (qi 9-1) is a 
generic divisor of the canonical system |f|. But then the points qi,- - -, 9). 
are independent generic points: in the non-hyperelliptic case because the 
canonical system has the independence property, while in the hyperelliptic 
case it is clear since a generic canonical divisor of || is writable > ¢;-+ t/, 
where ¢; and ¢,;’ are conjugates over a quadratic subfield of &(C). There are 
thus three cases. Let a‘ and b‘ be two different divisors selected from 
(qi If both contain or if neither does, they .anot 
be linearly equivalent since, after subtracting the p, if necessary, two generic 
divisors of degree = g cannot be linearly equivalent, while if one contains p, 
but not the other, it is still impossible because a generic divisor of degree 
1g has dimension zero and is therefore not linearly equivalent to another 
divisor of the same degree. 

Finally to prove statement (iii), since the canonical system is of dimen- 
sion g—1, we have (as in the preceding proof) a uniquely determined 
canonical divisor of the form 4 1)-+0%*. Then clearly 
o+ (qi +: ++ a special divisor of degree g+ 7 (if take 
just 0), and so 8S“) X[o] cannot be empty because it contains 


O+ + (2 —9 Po. 


8. The Euler characteristic. Out of curiosity, and in order to sneak 
Newton into this paper, we compute the Euler characteristic of the vector 
bundle whose Chern classes we have just determined in Theorem 3. 

The Chern classes of /—that is, the Chern classes of the tangent bundle 
to J—are trivial, since the tangent bundle is trivial: it is enough to show 
that the dual bundle of 1-forms is trivial, but this is evident because it has 
a basis at every point consisting of the g linearly independent invariant 
regular simple differentials on J. The Riemann-Roch-Hirzebruch formula 
[2] thus reduces to 


where the 8; are defined formally, by the Corollary to Theorem 3, by 


1— + —- - (—1)u, = (14+ 8,27): -(14 8,2), 
§;=0,1> g. 


Expanding the exponential series and looking just at the term of weight g, 
we get 


X= xg[ (1/9!) (89 -+8,9)]. 


SYMMETRIC PRODUCTS AND JACOBIANS. 


Letting s,—5,*-+- - we now invoke, for k=1,- - -,g, [9], 
Newton's identities: sy + + = — hug. 


These give a system of linear equations for determining the s;, whose solution 
for s, is by Cramer’s rule, 


— Jug Uy Ug-1 
—(g—1)Ugs 1 Ug» 


According now to intersection formulas of Matsusaka and Weil modulo 
numerical equivalence [7], == from which we deduce, if %,—<g, 


All the terms in the above determinant, when it is expanded out, are indeed 
of weight g, and we get therefore 
= (1/9!) 89] = 


1 
Thus x 0, since the first and last columns differ by a sign. 


9. Some relations in A(J). Quite generally, if (P(H),X,7) is a 
projective bundle derived from a vector bundle / of rank p over a base space 
X of dimension g and é€ A,(P(F)) is the associated divisor class, we can 
deduce trivially from the Chern relation 

some relations in A(X). Namely, multiply the relation through by é*, 
1=0,:--,g—1, project onto XY and use the projection formula; this gives 

a set of relations which may be summarized as 

in view of the fact that r(é*)=1, r(é/) =0 if j << p—1. 

Applying this to our situation, we have r(é*‘) = w;,,, and so 


(1—wu, + + (—1) (1+ 02+: =1; 


written in terms of the cycles, this becomes 


205 
e 
a 
a 


ARTHUR MATTUCK 


TueorEM 4. If w; is the rational equivalence class of W; (in Weil’s 
notation W,) on J and u; the class of Wi;* = (W7,)-, then 


W,— = 0,7 
We— + 
Wg *+ (—1)%y=0. 


These express therefore the u; in terms of the w; The nature of the 
relations suggest that if one applies the map o: r—>—z-+c to J, the 
resulting projective bundle (C(n),J,o~) whose Chern classes are (—1)*w;— 
or rather, prehaps its dual—should be in some sense the “opposite” bundle 
to (C(n),J,7). 


MASSACHUSETTS INSTITUTE OF TECHNOLOGY. 


REFERENCES. 


[1] A. Andreotti, “On a Theorem of Torelli,” American Journal of Mathematics, vol. 
80 (1958), pp. 801-828. 

[2] A. Borel and J. P. Serre, “ Le Theoreme de Riemann-Roch,” Bulletin de la Société 
Mathématique de France, vol. 86 (1958), pp. 97-136. 

[3] W.-L. Chow, “The Jacobian variety of an algebraic curve,’ American Journal of 
Mathematics, vol. 76 (1954), pp. 453-476. 

[4] ———, “ Remarks on my paper, ‘The Jacobian variety of an algebraic curve,’ ” 
ibid., vol. 80 (1958), pp. 238-240. 

[5] A. Grothendieck, “ La théorie des classes de Chern,” Bulletin de la Société Mathé- 
matique de France, vol. 86 (1958), pp. 137-154. 

[6] ———, “Sur quelques points d’algébre homologique,” Téhoku Mathematical 
Journal, 2nd series, vol. 9 (1957), pp. 119-221. 

[7] T. Matsusaka, “On a characterization of a Jacobian variety,” Memoirs of the 
College of Science, University of Kyoto, Series A, vol. 32, Math. no. 1 
(1959), pp. 1-19. 

[8] A. Mattuck, “ Picard bundles,” to appear. 

[9] I. Newton, Arithmetica Universalis, 1707. 

G. Washnitzer, “ The characteristic classes of an algebraic fiber bundle, I,” Pro- 
ceedings of the National Academy of Sciences, vol. 42 (1956), pp. 433-436. 
A. Weil, Fiber spaces in Algebraic Geometry, mimeographed notes, University of 

Chicago, 1952. 


, Variétés Abéliennes et Courbes Algébriques, Paris, 1948. 


206 
[ 
[ 
C12) 


CORRECTION TO “APPLICATIONS OF THE THEORY OF MORSE 
TO SYMMETRIC SPACES” (This Journal, vol. 80 (1958), pp. 964-1029).* 


By Raovut Bort and Hans SAMELSON. 


It has been pointed out to us by H. Seifert that our characterization of 
simplices “hanging over critical points” in the paper cited above (conditions 
(a) (b) (c) on p. 978) is insufficient and that therefore our description of 
the theory of Morse in Proposition 8.3, p. 978, is incorrect as stated. If the 
space © were a manifold, one would only have to add the requirement that 
the singular simplices in question be differentiable and non-degenerate (have 
non-singular differential, at least at the barycenter). But in a function space 
the description of the associated simplices is more complicated. One of 
Morse’s procedures (cf. [16, pp. 38, 56, 58; 17, pp. 52, 81]) is to introduce 
a set of cross manifolds Q; (including the end-manifold) along the geodesic 
segment s, and with their help to imbed the point s of 2 into a manifold 
P, =I] Qx of geodesic polygons, contained in Q. The function L | 
and has a non-degenerate singularity of index A, at s. The main fact is now 
that any singular simplex associated to s in P, also serves as associated 
simplex in ©. With this in mind we shall prove below that the singular 
simplex (o3,¢;) of p. 984, 1. 2% is indeed associated to s. (This is where 
we had used our incorrect characterization.) The remainder of the proof on 
pp. 984, 985 is unchanged. Proposition 10.2(d), p. 981, and its proof in 
Section 11, pp. 983, 984, become unnecessary. 

We may and shall assume that among the Q, there is one at each excep- 
tional point s(¢;) and one at some point s(#;) with 7; in (t,tj,.) for i=1, 

‘ +,m; further each cross manifold contains locally the orbit of the point 
to which it is attached. 

The map f,: T;—Q, restricted to a small neighborhood of w,, factors 
then through a C*-map h, into P,. 


LemMMA (a). hg is non-degenerate at wy. 


Proof. Let X be a non-zero vector of T, at ws; let Y be a vector of W, 
at (¢,---,e) with ¥.(Y)—X. We write Y—(Y;, --,Yn), with Y; a 
vector of K;. Let Q be a cross manifold, attached to some é;. The image of 
X under the composition of h,, projection of P, onto its factor Q, and inclusion 
of Q in M is easily found to be (¥,-+- - -+ Yi) (s(&)) (cf. def. 1.6 for Y). 


* Received December 20, 1960. 
207 


208 CORRECTION. 


If 7 is the smallest value for which Y; is not tangent to the subgroup K, of K; 
(this exists since K ~0), the vector so obtained is not zero, since i, is not 


exceptional, and Lemma (a) is proved. 


Next we consider a local lemma. Let f be a C*-function on a neighbor- 
hood U of a point p in a manifold of dimension m, with p as only critical 
point, non-degenerate, of index ». Write A for {~€ U: f(z) =f(p)}, Ar for 


U: f(x) <f(p)}, Ao for U: f(t) =f (p)}. 
Let »: c—> U be a C®-singular simplex of dimension A such that 


(i) yw consists of just the barycenter 6 of oa, 
(ii) fy is constant, i.e., u(o) C Ao, 
(iii) the differential of » is non-singular at b. 


Lemma (b). Under the above hypothesis, w represents a generator of 
H(A, A — {p}), for any coefficient group. Further, if yp’: o— U 1s a singular 
simplex sufficiently close to p, with p’(c) C A-U {p}, then p’ is a generator 
of H)(A-U {p},A-), again for any coefficient group, 1.e. it serves as asso- 
ciated simplex for the critical point p. 


Proof. We may assume (by [16]) that U = Euclidean space EH”, that p 
is the origin and that f has the form —2,?—- 2%)? + +° 
Both pairs (A, A —{p}) and (A-U {p},A-) have as deformation retract the 
pair (£*, E\—{p}), where is th subspace of spanned by the first A 
axes ; the retraction map is identical with the projection 6 along the orthogonal 
complement of H*. Because of (iii) the “light cone” A, contains linear 
subspaces of dimension A; any such space maps in non-degenerate fashion 
under @. The differential of 6°, is therefore non-degenerate at b, and the 
lemma follows by standard arguments. Incidentally, A is necessarily = m/2. 


We come now to the proof that the simplex (¢s,¢;) of no. 12, p. 984, 
is associated to s. The simplex hs°p,: o, > P, (defined if ps(o,) is small 
enough) satisfies the hypotheses of Lemma (b) ; (i) is clear, (ii) follows from 
the second sentence on p. 981, and (iii) is implied by Lemma (a). The 
standard retraction of a neighborhood of s (in Q) into P, sends the singular 
simplex ¢, into a singular simplex ¢,’ with ¢,’(o,)C P,. Since the L-value 
does not increase under the retraction, Lemma (b) applies, so that ¢,’, being 
close to hs°p, for small wu, is associated to s in P,; but then ¢, is associated 
to s in Q. Q. E. D. 


HARVARD UNIVERSITY, 
INSTITUTE FOR ADVANCED STUDY, 
AND STANFORD UNIVERSITY. 


3 
} 
4 
4 


ON THE PREPARATION OF MANUSCRIPTS. 


The following instructions are suggested or dictated by the necessities of the technical pro- 
duction of the American Journal of Mathematics. Authors are urged to comply with these 
instructions, which have been prepared in their interests. 

Manuscripts not complying with the standards usually have to be returned to the authors 
for typographic explanation or revisions and the resulting delay often necessitates the defer- 
ment of the publication of the paper to a later issue of the -Tournal. 


Horizontal fraction signs should be avoided. Instead of them, use either solidus signs / or 


negative exponents. 
Neither a solidus nor a negative exponent is needed in the symbols 3, ag has which are 


available in regular size type. 
Binomial coefficients should be denoted by C," and not by parentheses. Correspondingly, 
for symbols of the type of a quadratic residue character the use of some non-vertical arrange- 


ment is usually imperative. 
For square roots use either the exponent 3 or the sign V without the top line, as in V— 1 


or V(a+ 6). 

Replace e‘ > by exp( ) if the expression in the parenthesis is complicated. 

By an appropriate choice of notations, avoid unnecessary displays. 

Simple formulae, such as A + iB = }C* or 8, =4@,+.- - --+ Gp, should not be displayed 
(unless they need a formula number). 

Use ’ or d /da, possibly D, but preferably not a dot, in order to denote ordinary differen- 
tiation and, as far as possible, a subscript in order to denote partial differentiation (when the 
symbol @ cannot be avoided, it should be used as 9 /da). 

Commas between indices are usually superfluous and should be avoided if possible. 

In a determinant use a notation which reduces it to the form det a,,. 

Subscripts and superscripts cannot be printed in the same vertical column, hence the 
manuscript should be clear on whether a,* or a*, is preferred. (Correspondingly, the limits of 
summation must not be typed after the =-sign, unless either 2.” or 2": is desired.) If a letter 
carrying a subscript has a prime, indicate whether a,’ or a’, is desired. 

Experience shows that a tilde or anything else over a letter is very unsatisfactory. Suck 
symbols often drop out of the type after proof-reading and, when they do not, they usually 
appear ureven in print. For these reasons we advise against their use. This advice applies 
also to a ba: over a Greek or German letter (for the symbol of complex conjugation an asterisk 
is often allowed by the context). Type carrying hars over ordinary size italic letters of the 
Latin alphabet is available. 

Bars reaching over several letters should in any case be avoided (in particular, type 
lim sup and lim inf instead of lim with upper and lower bars). 

Repeated subscripts and superscripts should be used only when they cannot be avoided. 
since the index of the principal index usually appears about as large as the principal index. 
Bars and other devices over indices cannot be supplied. On the other hand, an asterisk or a 
prime (to be printed after the subscript) is possible on a subscript. The same holds true for 
superscripts. 

Distinguish carefully between I. c. “oh,” cap. “oh” and zero. One way of distinguishing 
ae is by underlining one or two of them in different colors and explaining the meaning of 
the colors. 

Distinguish between e (epsilon) and € or e (symbol), between @ (eks) and X (multipli- 
cation sign), between 1. c. and cap. phi, between I. c. and cap. psi, between I. c. and kappa and 
between “ell” and “one” (for the latter, use t and | respectively). 

Avoid unnecessary footnotes. For instance, references can be incorporated into the text 
(parenthetically, when necessary) by quoting the number in the bibliographic list, which 
appear at the end of the paper. Thus: “ [3], pp. 261-266.” 

Except when informality in referring to papers or books is called for by the context, the 
following form is preferred: 

[3] O. K. Blank, “ Zur Theorie des Untermengenraumes der abstrakten Leermenge,” Bulletin 
de la Societé Philharmonique de Zanzibar, vol. 26 (1891), pp. 242-270. 
In any case, the references should be precise, unambiguous and intelligible. 

Usually sections numbers and section titles are printed in bold face, the titles “ Theorem,” 
“Lemma ” and “ Corollary ” are in caps and small caps, “ Proof,” “ Remark ” and “ Definition” 
are in italics. This (or a corresponding preference) should be marked in the manuscript. 
Use a period, and not a colon, after the titles Theorem, Lemma, etc. 

German, script and bold face letters should be underlined in various colors and the meaning 
of the colors should be explained. The same device is needed for Greek letters if there is a 
chance of ambiguity. In general, mark all cap. Greek letters. 


All instructions and explanations for the printer can conveniently be collected on a 
separate sheet, to be attached to the manuscript. 


In case of doubt, recent issues of the Journal may be consulted. 


THE JOHNS HOPKINS PRESS BALTIMORE 18 


Prices Effective January 1960 

American Journal of Hygiene. Edited by ABkanHAM G. OsLER, Managing Editor, A. M. 
BaETJER, MANFRED M. Mayer, R. M. Hergiort, F. B. Bane, P. E. SARTWELL, and 
Ernest L. Stessins. Publishing two volumes of three numbers each year, 
volume 69 is now in progress. Subscription $12 per year. (Foreign postage, 50 
cents; Canadian postage, 25 cents.) 

American Journal of Mathematics. Edited by W. L. Cuow, J. A. Dizuponng, A. M. 
GLEASON and PHILIP HARTMAN. Quarterly. Volume 81 in progress. $11.00 per 
year. (Foreign postage, 60 cents; Canadian, 30 cents.) 

American Journal of Philology. Edited by H. T. Rowext1, Lupwic EpELsTEIN, JAMES 
W. Pouttney, JoHN H. Youne, James H. Oviver, and Evetryn H. Curt, Secre- 
tary. Volume 85 is in progress. $6.00 per year. (Foreign postage, 50 cents; 
Canadian, 25 cents.) 

Bulletin of the History of Medicine. Owszr TEMKIN, Editor. Bi-monthly. Volume 33 
in progress. Subscription $6 per year. (Foreign postage, 50 cents; Canadian 
25 cents.) 

Bulletin of the Johns Hopkins Hospital. Pump F. WaGiey, Managing Editor. Monthly. 
Subscription $10.00 per year. Volume 104 is in progress. (Foreign postage, 50 
cents; Canadian, 25 cents.) 

ELH. A Journal of English Literary History. Edited by D. C. Atten (Senior Editor), 
G. E. BENTLEY, Jackson I. Cope, RicHarkD HAMILTON GREEN, J. HILLIS MILLER, 
Roy H. Pearce, and E. R. WassERMAN. Quarterly. Volume 25 in progress. 
$6.00 per year. (Foreign postage, 40 cents; Canadian, 20 cents.) 

Johns Hopkins Studies in Romance Literatures and Languages. Seventy-six numbers 
have been published. 

Johns Hopkins University Studies in Archaeology. Thirty-nine volumes have appeared. 

Johns Hopkins University Studies in Geology. Seventeen numbers have been published. 

Johns Hopkins University Studies in Historical and Political Science. Under the direc- 
tion of the Departments of History, Political Economy and Political Science. 
Volume 77 in progress. $6.50. 

Modern Language Notes. NaTHan EpELMAN, General Editor. Eight times yearly. 
Volume 74 in progress. $8.00 per year. (Foreign postage, 60 cents; Canadian, 
30 cents.) 


A complete list of publications will be sent on request 


There was published in 1956 a 64-page 
INDEX 
to volumes 51-75 (1929-1953) of 


THE AMERICAN JOURNAL OF MATHEMATICS 


The price is $2.50. A 60-page Index to volumes 1-50 (1879-1928), 
published in 1932, is also available. The price is $3.00. 
Copies may be ordered from The Johns Hopkins Press, Baltimore 


18, Maryland. 


